Home

This is a completely re-written SolrMarc. It was started from code written by Oliver Obenland, who devised the design and wrote the initial code around which this re-write was built.

The goal of the design is a program which operates much the same as the earlier versions of SolrMarc, including being able to process index specifications that worked with previous versions and produce substantially the same Solr records. But with the further goals of operating much faster and supporting a richer superset of features in the index specification language.

Some of the speed-up achieved is due to "compiling" the index specification into an internal format once, and then applying each of those "compiled" Indexer objects to the MARC records being processed. The "compiled" Indexer objects basic architecture consists of an Extractor to extract data from the MARC record being processed, zero of more Map objects to transform and filter that data, and a Collector object to control how the data should be added to the Solr Document.

Among the many changes in this new version are:

Much richer index specification language
Ability to compile user-supplied custom methods at runtime
Ability to gather records and submit them to Solr in large chunks
Ability to run against multiple different versions of Solr without needing to be re-compiled
Ability to run multiple indexers in threads in parallel

With all these changes some of the features of previous versions are no longer available/no longer needed:

The release will no longer being combined into a One-Jar file. There is now just a single normal jar file, plus a small list of dependencies.
The release no longer will support direct writing to Solr index files. It merely builds SolrInputDocument objects and sends them to an already running Solr Server.
The project will no longer include "example" builds. They will exist but they will be separate GitHub repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Home

Clone this wiki locally