Any AI agent will go above and beyond to complete assigned tasks, even breaking through their carefully designed guardrails.
Explores our fatal attraction to AI, examining emotional dependence, manipulation, authority, and agency in work and life.
A reinforcement learning environment is a fail-safe digital practice room where an agent can afford to make mistakes and learn from them without real-world consequences.
Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results