18 points | by sjmaplesec 10 hours ago ago
2 comments
Okay, but how would I write evals for my project's agents file? Any good examples out there?
I wrote https://ai-evals.io (community site) to make the concept approachable no matter what tools you choose to use.
You can learn about them evaluating that site https://github.com/Alexhans/eval-ception and then the pattern should be easy to test on your own thing.
Okay, but how would I write evals for my project's agents file? Any good examples out there?
I wrote https://ai-evals.io (community site) to make the concept approachable no matter what tools you choose to use.
You can learn about them evaluating that site https://github.com/Alexhans/eval-ception and then the pattern should be easy to test on your own thing.