Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve CRUD on scoring function llm-as-judge #1454

Open
yanxi0830 opened this issue Mar 6, 2025 · 1 comment
Open

Improve CRUD on scoring function llm-as-judge #1454

yanxi0830 opened this issue Mar 6, 2025 · 1 comment
Assignees
Labels
enhancement New feature or request
Milestone

Comments

@yanxi0830
Copy link
Contributor

yanxi0830 commented Mar 6, 2025

🚀 Describe the new functionality needed

We need to refactor llm-as-judge to make it easy for user to perform CRUD operations.

  • unregister scoring functions
  • persisting judge prompts
  • migrating simpleqa judge prompts to other repo

#1405

💡 Why is this needed? What if we don't build it?

Prepare for llama-stack-evals repo

Other thoughts

No response

@yanxi0830 yanxi0830 added the enhancement New feature or request label Mar 6, 2025
@yanxi0830 yanxi0830 self-assigned this Mar 6, 2025
@yanxi0830 yanxi0830 added this to the v0.1.7 milestone Mar 6, 2025
@yanxi0830
Copy link
Contributor Author

pending on #1580

@yanxi0830 yanxi0830 modified the milestones: v0.1.7, v0.1.8 Mar 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy