Skip to content
@amazon-science

Amazon Science

Popular repositories Loading

  1. chronos-forecasting chronos-forecasting Public

    Chronos: Pretrained Models for Time Series Forecasting

    Python 4.6k 542

  2. mm-cot mm-cot Public

    Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

    Python 4k 334

  3. auto-cot auto-cot Public

    Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

    Jupyter Notebook 2k 182

  4. patchcore-inspection patchcore-inspection Public

    Python 1.1k 210

  5. RAGChecker RAGChecker Public

    RAGChecker: A Fine-grained Framework For Diagnosing RAG

    Python 1k 85

  6. siam-mot siam-mot Public

    SiamMOT: Siamese Multi-Object Tracking

    Python 490 60

Repositories

Showing 10 of 435 repositories
  • Cyber-Zero Public

    Cyber-Zero: Training Cybersecurity Agents Without Runtime

    amazon-science/Cyber-Zero’s past year of commit activity
    Python 51 9 3 18 Updated Dec 30, 2025
  • amazon-science/CodeAssistBench’s past year of commit activity
    Python 5 Apache-2.0 1 1 0 Updated Dec 30, 2025
  • chronos-forecasting Public

    Chronos: Pretrained Models for Time Series Forecasting

    amazon-science/chronos-forecasting’s past year of commit activity
    Python 4,583 Apache-2.0 542 18 6 Updated Dec 30, 2025
  • CTF-Dojo Public

    Training Language Model Agents to Find Vulnerabilities with CTF-Dojo

    amazon-science/CTF-Dojo’s past year of commit activity
    Python 23 5 2 0 Updated Dec 29, 2025
  • carbon-assessment-with-ml Public

    CaML: Carbon Footprinting of Household Products with Zero-Shot Semantic Text Similarity

    amazon-science/carbon-assessment-with-ml’s past year of commit activity
    Jupyter Notebook 54 Apache-2.0 11 0 0 Updated Dec 23, 2025
  • MEMERAG Public

    MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented Generation

    amazon-science/MEMERAG’s past year of commit activity
    Python 4 0 1 4 Updated Dec 23, 2025
  • amazon-science/LibEvolutionEval’s past year of commit activity
    Python 1 0 0 0 Updated Dec 22, 2025
  • SWE-PolyBench Public

    SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents

    amazon-science/SWE-PolyBench’s past year of commit activity
    Python 75 MIT 11 0 0 Updated Dec 18, 2025
  • amazon-science/MigrationBench’s past year of commit activity
    Python 13 Apache-2.0 1 0 2 Updated Dec 17, 2025
  • alexa-arena Public
    amazon-science/alexa-arena’s past year of commit activity
    Python 108 LGPL-2.1 11 1 3 Updated Dec 16, 2025