Skip to content
@amazon-science

Amazon Science

Popular repositories Loading

  1. mm-cot mm-cot Public

    Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

    Python 3.9k 329

  2. chronos-forecasting chronos-forecasting Public

    Chronos: Pretrained Models for Probabilistic Time Series Forecasting

    Python 3.5k 407

  3. auto-cot auto-cot Public

    Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

    Jupyter Notebook 1.9k 171

  4. patchcore-inspection patchcore-inspection Public

    Python 950 186

  5. RAGChecker RAGChecker Public

    RAGChecker: A Fine-grained Framework For Diagnosing RAG

    Python 942 80

  6. siam-mot siam-mot Public

    SiamMOT: Siamese Multi-Object Tracking

    Python 485 61

Repositories

Showing 10 of 397 repositories
  • amazon-science/application-eval-data’s past year of commit activity
    0 0 0 0 Updated Jul 22, 2025
  • calfwsat Public
    amazon-science/calfwsat’s past year of commit activity
    C 0 MIT-0 0 0 0 Updated Jul 22, 2025
  • carbon-assessment-with-ml Public

    CaML: Carbon Footprinting of Household Products with Zero-Shot Semantic Text Similarity

    amazon-science/carbon-assessment-with-ml’s past year of commit activity
    Jupyter Notebook 49 Apache-2.0 10 0 1 Updated Jul 21, 2025
  • TurboFuzzLLM Public

    TurboFuzzLLM: Turbocharging Mutation-based Fuzzing for Effectively Jailbreaking Large Language Models in Practice

    amazon-science/TurboFuzzLLM’s past year of commit activity
    Python 11 Apache-2.0 1 0 0 Updated Jul 21, 2025
  • amazon-science/MixtureOfAdapters’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 0 0 0 Updated Jul 18, 2025
  • SWE-PolyBench Public

    SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents

    amazon-science/SWE-PolyBench’s past year of commit activity
    Python 56 MIT 5 1 0 Updated Jul 16, 2025
  • wraval Public

    WRAVAL helps in evaluating LLMs for writing assistant tasks like summarization, professional tone, witty tone, etc.

    amazon-science/wraval’s past year of commit activity
    Jupyter Notebook 5 2 0 10 Updated Jul 15, 2025
  • amazon-science/llm-asymptotic-decoding’s past year of commit activity
    Jupyter Notebook 10 0 0 8 Updated Jul 15, 2025
  • CiteEval Public

    Official repository for CiteEval: Principle-Driven Citation Evaluation for Source Attribution

    amazon-science/CiteEval’s past year of commit activity
    Python 1 0 0 2 Updated Jul 13, 2025
  • fmcore Public

    Running GenAI models at every scale, on every modality

    amazon-science/fmcore’s past year of commit activity
    Python 6 Apache-2.0 1 5 3 Updated Jul 11, 2025