Skip to content
haschart edited this page Sep 22, 2016 · 12 revisions

This is a completely re-written SolrMarc. It was started from code written by Oliver Obenland, who devised the design and wrote the initial code around which this re-write was built.

The goal of the design is a program which operates much the same as the earlier versions of SolrMarc, including being able to process index specifications that worked with previous versions and produce substantially the same Solr records. But with the further goals of operating much faster and supporting a richer superset of features in the index specification language.

Some of the speed-up achieved is due to "compiling" the index specification into an internal format once, and then applying each of those "compiled" Indexer objects to the MARC records being processed. The "compiled" Indexer objects basic architecture consists of an Extractor to extract data from the MARC record being processed, zero of more Map objects to transform and filter that data, and a Collector object to control how the data should be added to the Solr Document.

Among the many changes in this new version are:

With all these changes some of the features of previous versions are no longer available/no longer needed:

  • The release will no longer being combined into a One-Jar file. There is now just a single normal jar file, plus a small list of dependencies.
  • The release no longer will support direct writing to Solr index files. It merely builds SolrInputDocument objects and sends them to an already running Solr Server.
  • The project will no longer include "example" builds. They will exist but they will be separate GitHub repositories
Clone this wiki locally