1 Introduction
In recent years a number of Information Centers, Documentation, Archives, Library have launched a relentless work of digitization of information resources for later broadcast on the Internet. These processes have sued a number of tools aimed at resolving the management of digital resources and publishing on the web, serving a growing number of users.
response, some institutions in this environment have been inundated with offers of services designed to locate online digital repositories, with a varied price range and systems based on very different platforms and programs.
And so we have internet service newly created, from simple OPAC catalogs, photo databases, to large collections of digital documents offered by archival institutions responsible for them.
If surf internet regularly for information centers than we will have established the existence of portals, websites, or databases, which describe themselves as digital files. In many cases, they are not. For there to be a Digital Archive as a repository must give at least the following features:
-
Access is via a computer that connects Digital File content using communication networks. Even today there are other devices such as next generation mobile, or E-Books, able to connect to these services.
-
The Archive has its own repository, consisting of the documentation produced and received by an institution or person, throughout its history. The documentation has been previously selected, classified, described, and processed to allow later retrieval by performing searches for users or machines.
- File
materials included in the repository have in common: their treatment (the use of certain software and tools) and processes prior description, (through standards, league tables, labels, descriptors, controlled languages, fields, metadata, ...).
-
All documents in the repository are interrelated, forming a group that identifies the file online.
-
The repository is intended to present the documentary from the archive, to meet the information needs of some users.
Meet these characteristics and achieve success in creating a digital repository requires the use of technological tools that work properly and offer quality services.
There are not many programs for the creation of digital repositories, and in many cases, proprietary software is designed based on intuition and experience of developers, and with luck, with the collaboration of archivists and other professionals responsible for documentation. Among the wide range of programs, we have the options of free software programs or business license.
is not a debate about free software, but anyone responsible for developing a digital repository must keep in mind:
-
What programs we have available, both free and proprietary software to meet our needs.
-
Potential acquisition costs, license renewal, and adaptation to our project of proprietary software.
-
Potential costs of adaptation to free software project.
-
security and independence that offer various programs, access, use and control of the repository, and programming languages.
-
The experiences of other users.
-
And if we plan to develop our own digital repository management system by creating new software, we must be certain to improve all existing programs, with lower investment costs to buy or customize the software available.
One of the main advantages of free software for the commercial, is the ability to adapt to the needs and expectations of each institution. Proprietary software offers a tool that can only be used as is, "canned" with no possibility of changes, improvements, or adaptation to our needs. The free software we can modify and adapt to new tasks or eliminating those that do not interest us, to be a tool of the Archives. If the program is not fully adapted to the needs of the institution, who will it mold to the program, causing serious risks and problems.
addition, changes made to free software to suit your project, make that improves the program itself, innovate, and even distributed to other users freely growing steadily with the support of everyone. And so it becomes a program that not only meet specific needs (which usually offers the proprietary source software), but many, varied, different, and different. For this reason free software may be the only alternative to development and innovation in the Archives, Libraries and other information centers.
This text does not address theoretical aspects of the Archives nor does it reflect the creation, digitization, description, conservation, or preservation of repositories, as it focuses on the use of free software tools Drupal and Greenstone (both GPL) to create an institutional repository a file . The final section collected various resources to learn more about using Drupal and Greenstone.
2 File, Greenstone, and Drupal: Tools for creating a repository.
When it comes to files or digital libraries, discuss the term implies that brings together both: digital repository. A digital repository is the collection of information, documents, and data collected, managed, processed, and available through the use of electronic infrastructure to service users that meet a specific profile of access and use.
then defined and explained various items on Drupal , Greenstone and File to show the roles of each in developing the repository.
2.1 Definition of each item.
| Drupal | Greenstone | File |
| CMS Content Manager | manufacturer of digital repositories
| Documents Joint |
A) Drupal www.drupal.org
Drupal is a Content Management System (Content Management System) module based CMS deployable, and configurable to allow you to display and manage all types of information. From the Web Drupal is defined as a structure Content Management, Content Management Framework (CMF). As opposed to a current CMS, Drupal focuses on the ability of customization by the administrator, rather than the default options of a CMS.
Drupal publish all kinds of information and tools (articles, images, surveys, blogs, forums, e-commerce ...) with a simple content management system, users, and permissions. Its history dates back to 2000 when two students University of Antwerp decided to find a way to share their notes and materials on the Internet, and discussing and working together. Years after the project has Drupal users and developers in almost all countries, and even has many companies that are dedicated exclusively to offering development services with this program.
Drupal CMS is a dynamic, content and offers sample is stored in a database that responds to users through a Web environment. Known for the quality of its services, strong community and attractive pages that generates a fairly simply.
The design of Drupal is especially suitable for building and managing all types of online portals such as corporate websites, or personal. It is also a tool for continuous process improvement to adapt to change and Internet trends.
Portal Archives & Special Collections "of Dickinson College developed with Drupal. Http://itech.dickinson.edu/archives/
B) Greenstone www.greenstone.org
Greenstone as their official website is a collection of software programs designed to create and deliver digital collections, providing a new way to organize and publish Internet information through . Greenstone consists of several tools for the administrator to develop digital collections easily and independently.
The Greenstone philosophy is to create an open source software to put in the public domain all materials, and develop universal access to culture. This is his rationale.
The history of Greenstone begins in 1995 when the University of Waikato (New Zealand) presents a digital library project based on the full-text indexing of electronic documents. Years later, the project Greenstone is one of the most effective tools for the development of digital libraries worldwide. In 2007 according to statistics from the developers, the program is used in over 70 countries, is downloaded an average of 150 times a day, runs on most common operating systems (there is even iPod version) and its interface is available in 40 languages. Success? Its simplicity of use, and especially its results.
Another success factor is the support that has Greenstone by UNESCO as a reference tool for the creation of digital libraries worldwide, and especially for developing countries development. The latest recognition of this program was in December 2008 when he won the award for technological collaboration of the American Foundation Andrew W. Mellon MATC.
as the BBC, the University of the Balearic Islands, foundations, or NGOs, have known the benefits of this program.
should be noted that although Greenstone, is presented as a tool for creating digital libraries, most users will use to develop digital repositories regardless of content, it is also a tool that can serve perfectly to make accessible a file on the Internet.
Portal Estela. Arxiu Digital Canovelles "developed with Greenstone. Http://estela.canovelles.cat/cgi-bin/library
C) File
A file can be defined briefly as the set of documents generated and collected by an institution or person throughout his life, during the performance of the activities of its own. The origin and formation of the files is given by a management activity or natural, practical and administrative, in any case by creating libraries. Activities and functions are too broad For example, service to the institution which has generated documentation, and also society in the case of public records or historical.
For our purposes, it is to develop a repository, the file must be scanned if it is traditional media (print, audio tapes, film media ...) or have a set of digital documents produced originally this support by the institution.
The materials will be included in the repository must meet certain technical conditions such as resolution, file formats, or size and legal requirements access rights, data protection, and even intellectual property.
2.2 Programme and elements needed to implement the repository.
| Drupal | Greenstone | File |
|
Apache MySQL PHP
Drupal | Program Package Greenstone | Scanned documents in predefined formats and to comply with legislation. |
A) To start Drupal:
For Drupal is required to operate these programs, also developed as free software:
-
Apache: Server Program, or well, but ill-advised a server IIS (Microsoft ).
-
MySQL : Program database manager that will manage with phpMyAdmin.
-
PHP: A programming language that allows, among other things, to create dynamic web.
-
Drupal: Content Management System CMS.
Some utilities offer dedicated servers Apache, MySQL, PHP between their services, otherwise require installation. If we decide to install Drupal on your own server (under Linux or Windows ) is recommend to install a package that encompasses all programs to save jobs, some possibilities:
-
Easy PHP http:/ / www.easyphp.org / index.php
-
WAMPP http://www.wampserver.com/en/
-
XAMPP http://www.apachefriends.org/
These programs only should be added Drupal, which can be downloaded from http://drupal.org/project/drupal
B) To work with Greenstone
Greenstone includes all programs and utilities necessary for operation and management (including Java Runtime Environment JRE, ImageMagick, Ghostscript, and Perl ). Note that Greenstone at the moment is oriented to work on their own servers, ie a computer at the institution connected to Internet or intranet with adequate bandwidth to allow access. Although we have tested installing Greenstone on dedicated servers or external, may encounter different problems, and also the storage space is needed, costs can rise enough. Greenstone available in http://www.greenstone.org/download
C) File documents
mentioned above, to develop the repository requires a series of digital documents in appropriate formats that meet certain technical and legal conditions. The formats we can use to include in Greenstone are varied: XML, MARC, CDS / ISIS, ProCite, BibTex, Refer, OAI, DSpace, METS, PDF, PDF / A, Word, RTF, HTML, ODT, TXT , Latex, ZIP, Excel, PPT, Email, source code, GIF, JIF, JPEG, JPEG 2000, TIFF, MPEG-1, MPEG-2, MPEG-4, MPA, WMV, WMA, ASF, MP3, and QuickTime among the most prominent. Depending on the policy of the institution, will include one or more formats, we must also take into account the most used and distributed by users for ease of reference.
Regarding XML, noting that could include Greenstone File descriptions based on EAD, EAC, or EAF, forming a series of parameters.
2.3 Functions of each item
| Drupal | Greenstone | File |
| offer any information about the Archive: activities, news, and also disclose their funds. | Manage Digital Repository (document) | provide the documents to include in the repository |
A) Drupal
As explained is a program Drupal Content Management System (CMS), and utilities we can use to design the network in the Archive as an institution and its content in many ways. They indicated
-
corporate image of the institution, including the history of the Archives, their finding aids, user guides, location map, services, or organization.
-
Broadcasting news related file or your environment, such as procurement, work performed, calendar of events, or improvements in services.
-
user discussion forums, research, or professional, public or restricted access by prior identification.
-
development of information literacy programs and outreach to groups unfamiliar with the Archives.
-
Create surveys and forms for users to think about the next resource to include in the repository. Allow
-
File improve positioning in Internet search engines.
-
File Linking with social networks to reach more users.
Ultimately create any type of content to help improve the image of the archive and digital repository on the Internet, facilitating the approach of users and providing its services and funding.
Some modules Drupal possible to develop digital repositories. Modules such as KnowledgeTree integration or Docman , and even tools like CCK and Viewer allow you to create the description fields to include documents. But it should be noted that although the integration of Drupal and the repository will be perfect, for large document collections modules can have unexpected results, and so far, updating the versions of both Drupal of modules, and even PHP itself can create serious problems with their work. The independence of Drupal and repository (Greenstone or other), to give greater assurance in its management and operation.
B) Greenstone.
Greenstone is the digital repository manager, it offers all the documents of the Internet Archive. Some of the activities to:
-
include digital documents in the chosen format (PDF, JPG, ...). ODT Describe
-
documents with certain norms and standards (ISAD-G, MARC ..). The description can be done individually, document by document, or describe the different groups together: the Fund, sections, subsections, and series. This last way is assigned to every resource included in each group the description itself automatically, thus developing a practical multi-level system.
-
way to organize the documents available to the user, identifying search patterns, the presentation of the description, or results, and navigation between the documents in the repository.
-
Publish documents online so that users can access and search the description of the document in full text, and even the inherent metadata files (size, dimensions, format or any other assignment.) .
Existen otros programas de código abierto destinados a la creación de repositorios digitales, con algunas funciones similares a Greenstone , como Fedora ( http://www.fedora-commons.org/ ) , o Dspace ( www.dspace.org ). Integrating Fedora or DSpace in Drupal is more advanced (http://drupalib DSpace .interoperating.info/node/205 and Fedora http://islandora.ca/) In the election of either program will depend on the needs and possibilities of the administrators of these services. Greenstone may be easier to install, configure, adapt and maintain the other two systems, but needs to be an assessment to determine the election. There are tools to migrate repositories developed Greenstone to DSpace, and vice . And even some institutions combine both ( William Staughton Collection. Http://www.aladin.wrlc.org/gsdl/collect/staughton/staughton.shtml)
version 2, has been relatively outdated with respect to user administration, official documentation, network maintenance, customization, or aspects of Web 2.0. Mainly due to have as its main objective the creation and publication of repositories in a simple, regardless of size. Hopefully version 3 of this program take into account new features, beyond the management and maintenance of the repository.
Moreover, this handicap may be a reason for Greenstone and Drupal pretty well be in complete harmony, freeing their tasks: Greenstone is a content manager, and Drupal raises a number of technical problems and maintenance, to become a today a tool for digital repositories. Both programs are easy to manage, and results are excellent independently, so if we join we can get a tool that fully suits our needs.
C) File :
The file will provide the documents and resources that form the digital repository at the time it will take care of proper administration, management and updating. The documents included will be determined by school policy, they can take into account the interests of potential users. You can also take advantage of any memorial or date highlighted to increase the repository with new resources related to such events, promoting the documents while the file, looking to attract new users beyond the scholar or professional researcher.
2.4 Assemble the File in Drupal and Greenstone.
Using Drupal opportunities and Greenstone in the Archive will provide and disseminate a wealth of information about the institution and documents to anyone interested in an attractive and easily.
To join with Drupal Greenstone can be done in two ways:
- By incorporating an iframe to website developed with Drupal. Thus fits Greenstone in CMS, and appears to be the same site, but operate independently. The repository should follow the same line of style and appearance used by Drupal.
The iframe code that can be inserted on any page of Drupal is:
\u0026lt;iframe scrolling = "auto" src = "http://direccion/biblioteca/greenstone/" style = border-style: hidden; width: 100%; height: 600px ;">\u0026lt;/ iframe>
- A more complex is to create a form in Drupal that describes the documents, and import them into Greenstone automatically, using XML or SOAP supported by perl ( Greentsone) and php (Drupal ). In this way both the management of the repository, such as your query is held entirely in Drupal. You should use modules to improve the presentation and use of information. On the other hand Drupal and Greenstone can be joined with the possibilities offered some modules ( nodewords, libraries, marc, OAI-PMH ... ), this idea of \u200b\u200bthe documentary consultant Oskar Calvo ( http://www.documentados.com ) unfortunately has not yet been implemented in a solid.
3 Examples.
In this site: Drupal is responsible for collecting institutional information ("Submission", "Help?" The action "...), and to promote their funds (..."), and Greenstone stress by offering the repository iframe to see the file. Bernardo Foundation archive Aladrén. Http://www.manuelalbar.org
Pictured Drupal is responsible for interacting with users surveys ( "part and choose" ) alert service and offer a RSS news aggregator ( Paragraph orange ) . Greenstone provides the repository, in this case a specialized library, in another iframe . My Virtual Library Natural Area http://miespacionatural.es/content/biblioteca-virtual
4 Some Sources and Resources
ALÓS-Moner, Adela. digital repositories: one concept, multiple views . [Online]. [Sl: Thinkepi], 2009. [Refers 2-2010] < http://www.thinkepi.net/repositorios-digitales-un-concepto-multiples-visiones >
Consultative Committee for Space Data Systems. Reference model for an Open Archival Information System. [Online]. Washington: CCSDS Secretariat, 2002. [Refers 01-2010] < http://public.ccsds.org/publications/archive/650x0b1.pdf >
[Panel IFLA and ICA]. Guidelines Digitization Projects collections and public funds, particularly those kept in libraries and archives. English translation. Madrid: Ministerio de Cultura, 2002. [Refers 01-2010] < http://www.mcu.es/archivos/docs/pautas_digitalizacion.pdf >
HEREDIA HERRERA, Antonia. What is a file? . Gijón: Trea, 2007. 135 pp. ISBN: 978-84-9704-306-9.
Tramulles, JESUS. Digital Library Greenstone . Published in: Tramullas, J. And Garrido, P. (Coords). Free software for digital information services . Madrid: Pearson Prentice Hall, 2006. ISBN 978-84-8322-299-7
Drupal
Official Website: www.drupal.org
Downloads: http://drupal.org/project/drupal
Drupal Hispanic Community: http://drupal.org.es/
Installing Drupal http://drupal.org.es/node/4530
Greenstone
Official Website: www.greenstone.org (Available in English)
Downloads: http://www.greenstone.org/download_es
All about Greenstone http://wiki.greenstone.org/
Greenstone Developer: http://trac.greenstone.org/
Greenstone Community English: http: / / gsdl-esdoc.berlios.de /
mailing list fully active (questions, suggestions, solutions
...):https: / / list. scms.waikato.ac.nz / mailman / listinfo / greenstone-users (All Users) http://www.freelists.org/list/greenstone_es (Users Hispanics)
------------------------
communication This outline was to be presented at the conference Barcelona OS Repositories (http://osrepositorios.uoc.edu/programa.html) . For lack of time could be improved, but at least has been for a post:).
0 comments:
Post a Comment