π§ͺ Step 3: Create and Run an Experiment
Experiments test your AI for vulnerabilities and behavioral issues. Let's create your first test.
3.1 Start Experiment Creation
- Go to your project dashboard
- Click "Create Experiment"
- Choose your experiment type:
- Adversarial Testing: Tests for security vulnerabilities
- Behavioral QA: Tests for functional correctness
3.2 Configure Your Experiment
Basic Settings:
- Name: Choose a descriptive name (e.g., "Security Test - Customer Support")
- Description: Explain what you're testing
- Model Provider: Select the provider you set up in Step 1
- Language: Choose the primary language for testing
Advanced Settings:
- Conversation Turn:
- Single-Turn: Test individual prompts
- Multi-Turn: Test full conversations
- Testing Level:
- Quick: ~500 tests, fast feedback
- Thorough: ~1200 tests, balanced coverage
- Comprehensive: ~2000+ tests, deep analysis
Integration Configuration:
Configure how ai+me connects to your AI:
-
Thread Initialization (if applicable):
- Endpoint URL for starting conversations
- Headers and payload format
-
Chat Completion:
- Endpoint URL for sending messages
- Headers and payload format
- Enable/disable streaming
-
Streaming Configuration:
- Enable Streaming: If your AI returns responses in real-time chunks
- Disable Streaming: If your AI returns complete responses at once
3.3 Launch Your Experiment
Click "Create Experiment" to start testing. Your experiment will:
- Initialize: Set up the testing environment
- Generate Tests: Create contextual test scenarios
- Execute Tests: Run prompts against your AI
- Analyze Results: Evaluate responses for issues
- Generate Report: Compile findings and insights
β±οΈ Expected Duration: 5-30 minutes depending on testing level and model response time.