Posts categorized "EuroFedora2005"

September 30, 2005

Euro Fedora User Meeting 2005 - presentations are up

Most of the presentations from the European Fedora [repository] User Meeting 2005 are now up.
They are linked as e.g. (ppt) within the program.

Also, as I mentioned previously, most of the presentations from Building the Info Grid 2005 are up as well, on the speakers and abstracts page.

September 28, 2005

Euro Fedora User Meeting 2005 - Fedora as an OAI-PMH (and WS) Compliant Data Provider

Fedora as an OAI-PMH (and WS) Compliant Data Provider (PowerPoint)
Ana Macario, Alfred Wegener Institute

* very data-centric organization
* involved in DataGrid

* SOA at AWI
[diagram of 2004]

In practice...
post-print -> repository
PI is supposed to release data when published, but by then, it is lost or there are excuses

So need Staging -> Publication

So: Fedora to do all this.

* Reasons for Fedora
- Virtual Repository
- not restricted to Dublin Core
- standards compliant
- etc.

[diagram of 2005]

currently using dc.source and dc.relation as a hack to express linkages,
convinced that proper way to do this is RDF/XML

Long-term issues
* Benchmarking for large number of files
* out-of-box web client acceptance
* fine-grained access control and Shibboleth based AuthN - relevant in DataGRID
* support for sets
* federation model
* collaboration and support
- disseminators for visualizations services; relevant for DataGrid
- Eclipse project to facilitate plugin devel
- Google strategy?
- seminars, tutorials for advanced Fedora users

Euro Fedora User Meeting 2005 - Introducing Pergamos: A Fedora-based Digital Library System utilizing Digital Object Prototypes

Introducing Pergamos: A Fedora-based Digital Library System utilizing Digital Object Prototypes (PowerPoint)
Kostas Saidis

- provide the means to treat content variations in a unified manner

- for digital object they want behaviors according to nature: if it is a book, give us the pages, we
don't care about the encoding

Digital Object Typing Info

* Humans interpret Content Models (in this case)
* problems: tool weaknesses

Preferred: DL should resolve DO type automatically, e.g. force conformity

DO Object Prototypes, based on OO methodology
- content
- private behaviors
- public behavior

Euro Fedora User Meeting 2005 - Fedora Content Models for the National Science Digital Library Data Repository (NDR)

13:15

Fedora Content Models for the National Science Digital Library Data Repository (NDR) (PowerPoint)
Carl Lagoze
Cornell Information Science

... Information Network Overlay

a digital library (particularly science DL) is more than just documents: it's data, orgs, people, services

all (primary resource) objects named with handles in overlay

... including Access Controlled API

interested in very rich granualarity and access control

some of this is still theoretical, not in production yet

1 million metadata records in Oracle (Dublin Core) will = a digital object in Fedora
there are ancillary objects around the metadata = about 2.1 million objects

NSDL is not a content library, it is a metadata library
but it will be possible to add content e.g. lesson plans

each object turns into about 60 triples = about 60 million? 120 million triples

NDR currently one Fedora repository, will become ? federated repositories

want to build representations of people in the NSDL, so that you can build a reputation system for those people
"this person is expert in X"

can use proai to expose using oai anything that you can connect with a disseminator?

make sets using aggregation

Euro Fedora User Meeting 2005 - Researching Fedora to Serve as Central Repository System of the State and University Library

Researching Fedora to Serve as Central Repository System of the State and University Library
Stephan Drescher (PowerPoint)
and Birte Christensen-Dalsgaard (PowerPoint)

* National Library - digital preservation
* Storage Preservation
- as many different systems as different objects (currently)
- including both digitized and born-digital (IR and Webarchive)

working on automatic ingest
working on national infrastructure for institutional repositories, with infrastructure in charge of
preservation - national infrastructure for storage with redundancy etc.

Preferred preservation strategy: migration

working with Royal Library in Holland who have an emulation program running with IBM

Stephan: Archiving of Denmark's Broadcasting of Radio and Television (BART)

* covering 24/7/365
* 220 GB a day
* data needs to be evaluated and eventually corrected after 48 hours
* automatically ingested into repository

How to fit Fedora within Bart's resource workflow

Uses Linux and off-the-shelf.

80-100 TB a year

Q: are you going to put this data into the repository?
A: into the repository only references (to the raw data) will be input

Euro Fedora User Meeting 2005 - DiPP Digital Peer Publishing

Digital Peer Publishing
DiPP Fedora-based system for Open Access eJournals (PowerPoint)
Jochen Schirrwagen
hbz

http://www.hbz-nrw.de/
http://www.dipp.nrw.de/

Mission
- foundation of new and expansion of existing scientific electronic journals
- customizable workflow within a common publication system [including peer review]
- fast, open and transparent digital peer publishing

Components and Middleware
- Peer Review System (external component)
- from Peer Review System via LDAP auth to Plone-based Publication System
- from Peer Review System via OAI to DiPP Services Fedora-based Repository
- from DiPP to Publishing system via SOAP

currently using Fedora 1.2

challenge: external peer review, Plone-based publishing
want: at least appearance of one unified system

Publication Pipeline: includes Repository Engine, Conversion Engine, Publication Engine
therefore DiPP Services
* Conversion Service
- uses commercial tool to get to DocBook XML, then XSLT to transform to HTML and even PDF
(convert XML to LaTeX using XSLT then to PDF?)
* URN service
- persistent identifier for journals, articles, supplementary material
* Distribution Service
- registration for OAI harvesting
- RSS feeds
- email alerts

also had to build Hierarchy Service because of limitations in Fedora version 1

Fedora - is it good enough?

pros
- highly modularized
- versioning of datastreams
- addition of own metadata formats

cons
- added metadata in own formats not searchable
- inconsistent versioning of API-A (Access) and API-M (Management) from Fedora 1 to Fedora 2

would be good to have a pool of common services

Summary and Outlook
- workflow-based publication system
- extended services on top of Fedora

Next Steps
- DINI certification (German centre for network information)
- full-text indexing via FAST search engine
- improvement of usability

dipp@hbz-nrw.de

Euro Fedora User Meeting 2005 - Institutional Repository In A Box

Institutional Repository In A Box (PDF)
Christian Tønsberg

DTU uses local (specialized) version of National Research Database

Current state:
- Data Production App: MetaToo
- Storage
- Indexer (zebra)
- Web Front (HTTP, java/tapestry, sru/srw)

want to talk to Fedora instead of National Research Database software
- generalize Data Production Application
- generalize Web Front
- should work out of the box

CVT Project Fedora2 (IR-in-a-box)
http://defxws.cvt.dk/projects/fedora2/

Euro Fedora User Meeting 2005 - 09:20 - Fedora Project Update

Fedora Project Update (PowerPoint)
Sandy Payette

version 2.1 of Fedora embodies the full core system they had envisioned

Fedora for Digital Archives and Records Management

interest in RDF capabilities

designed so that modules can be replaced

improved OAI provider service coming in 2.1, can also be used separately

objects stored as FOXML (Fedora Object XML)

variety of ingest/export formats
- FOXML, METS, METS 1.4, MPEG21 DIDL

Fedora 2.1
- introduce Fedora Service Framework (services distributed as part of the official distribution)
guarantee compatibility as Fedora evolves
- intro PROAI (OAI Provider Service) ? on sourceforge now ?
- intro directory ingest service

Authentication and Authorization improvements

* AuthN plugins
- HTTP basic auth
- Tomcat realms and login modules

* SSL

* AuthZ
- XML-based policies using XACML

developing a user interface for policy building

RDF-based Resource Index (RI)

* ontology of common relationships
* stored in Resource Index
- automatic index (using Kowari triple-store)

* RI search (search the repository as a graph)

* new in Fedora 2.1 for Resource Index
- scale and performance testing (NSDL 2 million objects, > 100 million triples)

RDF relaxed rules now permits relating to objects outside a specific repository.

There is a plan for federation and naming services.

improved logging using log4j

handle system plugin for PID generation

you can rebuiild the entire repository by crawling the FOXML,
there is now a Rebuild Utility for Repository Indices

FedoraClient utility class for building new (SOAP) clients

Fedora Future

2006
- better interface: FIRE client
- BPEL workflow engine
- advanced Fedora search
- preservation integrity
2007
- federation pid resolution
- preservation monitoring
- event notification (pub/sub)

Pathways InterDisseminator
- OpenURL access point

- "content model" specification language
- advanced object creation workbenches
- Shibboleth and Web Services security

Community Working Groups

- Preservation (Rutgers)
- Workflow (Peter Murray, OhioLink)
- Outreach (Rutgers)
  = improve Fedora web site
  = collaboration environment - Wiki, Confluence, other?
- Content Model Working Group (under charter)

There is a Fedora Advisory Board.
- define sustainability model

* there are collaborative development opportunties

* will put up a page on http://www.fedora.info/ for user-contributed tools

September 19, 2005

Building the Info Grid 2005

In one week I will be at Building the Info Grid 2005 - Digital Library Technologies and Services.
It lines up well with my interests as it will be covering both Service-Oriented Architectures as well as Shibboleth-based distributed authentication.

I didn't really know much about Copenhagen (København), Google Earth was a big help to me in figuring out the lay of the land.

I made a placemark for the Copenhagen Business School:

Download CopenhagenBusinessSchool.kmz

I also found this map (PDF) of some hotels associated with the conference to be useful.

I have gathered a few bookmarks together under http://del.icio.us/rakerman/copenhagen

Following the Info Grid conference I will be at the European Fedora User Meeting.
(Note: That's the Fedora repository software, not the operating system.)

I made a placemark for building 101A at the Technical University of Denmark (DTU):

Download TechnicalUniversityDenmark-Lyngby.kmz

I also found this map (PDF) useful for figuring out the bus stops on the DTU campus.

I think I will have good Internet connectivity, I plan to blog the conferences under categories/tags InfoGrid2005 and EuroFedora2005.

I will have instant messaging on if you want to contact me, there is more info linked from my conferences page.

(Note: Due to TypePad MIME type issues, you may have difficulties downloading the placemarks.  In Firefox, right-click and use Save Link.)

----

Search


  • Google
    Web scilib.typepad.com

Receive via Email



  • Powered by FeedBlitz

Twitter Updates

    follow me on Twitter

    Furl Linkblog

    Resources

    Recent Comments

    Referral

    StatCounter

    Googlytics

    Technorati

    Blog powered by TypePad
    Member since 11/2004