Skip to content

EdWarga/IRandXQuery

Repository files navigation

XQuery & IRs

Institutional Repository (IR)

DSpace

  • Open Source IR solution
  • all data managed by SQL db (postgres
  • spits out XML here and there
    • OAI-PMH - expose metadata records
    • XMLUI - interface based on XML views transformed to HTML via XSLT

IR Workflow

  • identify faculty scholarship (journal articles)
  • collect bib records (normalize? metadata)
    • MS access; Zotero
  • get permissions from copyright holder to deposit in IR
    • agree to terms in IR license
    • which version can we use - PrePrint, PostPrint, PublishedPDF
  • collect and prepare content (articles)
  • submit content we have permissions for to IR
  • catalogers review, edit, and accept submissions
  • mint handle (persistent URI to item page in IR)
  • add handle and add bitstream URL to bib record
    • query IR metadata (SOLR?, OAI-PMH), extract this data, add to bib records

Tracking Permissions

  • track individual publications/articles
    • export MODS records from Zotero
      • example: NewClearedList-MODS.xml
  • add permissions data to bib records (MODS+Permissions)
    • create namespace and element set to express permissions data if there are no suitable MODS elements
  • Permissions data elements
    • Publisher
    • Journal Title
    • Clearance - (Y/N)
    • Version (P,P,orP)
    • Embargo
    • Blanket Permission (Y/N)
    • Conditions
  • store bib records in eXist-db
  • use database to track permissions for individual items
  • use XQuery to query collection
    • view articles cleared for deposit
    • other administrative views 

Bibliography Service

  • artifact of IR workflow is collection of faculty bibliographies (see bib records above)
  • use XQuery to transform bib records
    • what form do I need to put the bibliographies on the web?
    • library website
    • Vanderbilt faculty website
    • give to faculty for use on personal site
    • what else?
  • transform to document - PDF?

IR Administration

  • produce metadata for ingest from bib records
    • csv for batch import
  • pull XML data about existing content items in IR
    • OAI-PMH
      • LawFacultyOAIMets_2014-02-19.xml
      • ReturnMods.xquery
    • scrape repository pages (download stats)
      • download count example - ExtractDownloadCount.xquery
    • SOLR?
  • use XQuery to compile information about them
    • usage report example
  • metadata quality
  • parse data from existing items and add to bib records
  • can I do what I need to with the IR's XML data, or do I still need to access data from postgres occasionaly? -or- even with the NoSQL toolbox, I still need SQL

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages