DevExplore
  • Categories
  • Tools Directory
  • AI Stack Builder
  • Resources
  • Jobs
  • Advertise
AboutContactSign in
Home/Tools Directory/Giskard
DevExplore

The discovery platform for developers

Platform

  • Categories
  • Tools Directory
  • AI Stack Builder
  • Resources
  • Jobs
  • Advertise

Community

  • Create account
  • Sign in
  • Submit a tool
  • Browse jobs

Company

  • About Us
  • Contact Us
  • Privacy Policy
  • Terms of Service
  • Cookie Policy

Get Updates

Occasional product updates and curated picks. No spam.

    © 2026 DevExplore. All rights reserved.

    About UsContact UsPrivacy PolicyTerms of ServiceCookie Policy
    1. Home
    2. /
    3. Tools Directory
    4. /
    5. Giskard
    G

    Added 5/29/2026

    Giskard

    Scan AI agents for vulnerabilities before and after deployment

    TestingEvaluationGuardrailsOpen Source
    Visit WebsiteGitHub

    Description

    Giskard is an open-source Python evaluation and red teaming framework built by Alex Combessie and Jean-Marie John-Mathews, headquartered in Paris. Combessie's prior role at Dataiku exposed a consistent gap in enterprise AI workflows: testing was fragmented, hard to compare across vendors, and unprepared for production edge cases. Giskard targets that gap with automated vulnerability scanning that generates domain-specific test cases from your own knowledge base, rather than applying generic probes. As the only major LLM testing platform built by a European entity, Giskard is purpose-built for EU AI Act compliance and data-residency requirements that US-based tools address only partially.

    Key Capabilities

    • Automated LLM scan: A single giskard.scan() call detects hallucinations, prompt injection, sensitive information disclosure, stereotypes, and harmful content across your LLM agent without manual test case authoring

    • RAGET (RAG Evaluation Toolkit): Generates realistic synthetic test cases directly from a RAG knowledge base to evaluate answer correctness, groundedness, and retrieval quality across pipeline components

    • EU AI Act and OWASP LLM Top 10 compliance packs: Pre-built compliance presets activate full vulnerability suites aligned to European regulatory requirements and OWASP LLM Top 10 categories from a single config entry

    • Black-box testing via API endpoint: Giskard tests any accessible API without requiring access to internal model architecture, vector databases, or source code, making it usable against third-party or vendor-hosted AI systems

    • Giskard Guards (guardrail platform): An on-premise guardrail layer with a Policy-as-Code framework that secures the full agent execution chain for regulated industries requiring EU-sovereign data processing

    • Continuous red teaming (Hub): Giskard Hub generates new adversarial attack scenarios automatically as threat landscapes evolve, with RBAC, audit trails, team collaboration, and GDPR-native data handling built into the enterprise tier

    See Giskard pricing details →

    Alternative tools

    • Claude Code

      Agentic coding tool that runs in your terminal

    • Patronus AI

      Score, benchmark, and stress-test LLM outputs for enterprise deployments

    • Harness

      AI-powered software delivery platform for the post-code lifecycle.

    • Spacelift

      IaC orchestration platform for Terraform, OpenTofu, and Pulumi teams.

    • Kiro

      AWS spec-driven AI IDE with GovCloud certification

    • CodeRabbit

      AI code review platform for pull requests and agent output

    Used in Stacks

    No saved stacks include this tool yet.

    Browse more in Testing