flawed-apps

Here are 2 public repositories matching this topic...

A micro-benchmark suite to assess the effectiveness of tools designed for IoT apps

Evaluate safety in long-horizon, tool-using AI agents with this collection of realistic trajectory benchmarks.

redux javascript graphql typescript text blockchain iot-platform artemis video-generation malicious-behaviors image-to-video flawed-apps defi a2a-mcp a2a-server

Add a description, image, and links to the flawed-apps topic page so that developers can more easily learn about it.

To associate your repository with the flawed-apps topic, visit your repo's landing page and select "manage topics."