phoenix2pytest
Turn a production LLM failure trace into a runnable pytest regression test.
An example is pre-filled below: just click Generate. Generation calls Gemini and usually takes a few seconds.
Turn a production LLM failure trace into a runnable pytest regression test.
An example is pre-filled below: just click Generate. Generation calls Gemini and usually takes a few seconds.