ConvoProbe enables the design of multi-turn conversation scenarios and their automatic execution on your Dify chatbot to assess response quality. Existing evaluation tools (LangSmith, Langfuse, Opik) excel at tracing and single-turn evaluation but do not support end-to-end multi-turn scenario design and execution. ConvoProbe fills this gap.
agents-ia