Skip to content
@nla

National Library of Australia

Popular repositories Loading

  1. outbackcdx outbackcdx Public

    Web archive index server based on RocksDB

    Java 31 20

  2. httrack2warc httrack2warc Public

    Converts HTTrack crawls to WARC files

    Java 30 6

  3. chropro chropro Public archive

    Chrome debugging protocol client for Java

    Java 10 2

  4. solrbackup solrbackup Public

    Python script for backing up a remote Solr 4 core or SolrCloud cluster

    Python 9 6

  5. chronicrawl chronicrawl Public archive

    Experimental continouous web crawler for web archiving

    Java 9

  6. bamboo bamboo Public

    Web archive collection manager

    Java 8 4

Repositories

Showing 10 of 75 repositories
  • nla-blacklight_common Public

    Common functionality for Blacklight and ArcLight applications

    nla/nla-blacklight_common’s past year of commit activity
    Ruby 0 0 0 3 Updated Oct 2, 2024
  • nla-blacklight Public

    Discovery application for the National Library of Australia's catalogue

    nla/nla-blacklight’s past year of commit activity
    Ruby 0 1 0 8 Updated Sep 30, 2024
  • nla-arclight Public

    Custom implementation of ArcLight for The National Library of Australia.

    nla/nla-arclight’s past year of commit activity
    Ruby 0 0 0 8 Updated Sep 30, 2024
  • ai-scout-imageSearchComparison Public

    Simple website to capture evaluation of different ways to search images.

    nla/ai-scout-imageSearchComparison’s past year of commit activity
    EJS 1 0 0 6 Updated Sep 26, 2024
  • ai-scout-entireCanberraTimes Public

    AI newspaper search proof of concept - all 3.1m CT articles 1926-94, build SOLR index with nomic embeddings, exploratory web interface with LLM summaries

    nla/ai-scout-entireCanberraTimes’s past year of commit activity
    JavaScript 0 0 0 3 Updated Sep 20, 2024
  • ai-scout-audio2 Public

    AI audio proof of concept #2 - read TEI transcripts, build SOLR index with nomic embeddings, exploratory search and delivery web interface

    nla/ai-scout-audio2’s past year of commit activity
    JavaScript 0 0 0 6 Updated Sep 20, 2024
  • ai-scout-pictures Public

    AI pictures proof of concept - crawl blacklight, build SOLR index with CLIP embeddings, exploratory web interface

    nla/ai-scout-pictures’s past year of commit activity
    HTML 1 1 0 6 Updated Sep 11, 2024
  • heritrix3 Public Forked from internetarchive/heritrix3

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

    nla/heritrix3’s past year of commit activity
    Java 0 777 0 0 Updated Sep 10, 2024
  • outbackcdx Public

    Web archive index server based on RocksDB

    nla/outbackcdx’s past year of commit activity
    Java 31 Apache-2.0 20 18 0 Updated Sep 9, 2024
  • pandas4 Public

    Web archive workflow system

    nla/pandas4’s past year of commit activity
    Java 3 Apache-2.0 2 16 0 Updated Sep 4, 2024

Top languages

Loading…

Most used topics

Loading…