Skip to content
@rungalileo

Galileo

Evaluate, observe, and protect your GenAI applications

Pinned Loading

  1. agent-leaderboard agent-leaderboard Public

    Ranking LLMs on agentic tasks

    Jupyter Notebook 207 22

  2. hallucination-index hallucination-index Public

    Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.

    116 9

  3. sdk-examples sdk-examples Public

    Examples on how to get started with the Galileo SDKs for AI Evaluation and Observability (both in Python and Typescript)

    Python 14 8

Repositories

Showing 10 of 56 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics