Spaces:

JustTheStatsHuman
/

Togmal-demo

Sleeping

App Files Files Community

Togmal-demo / DEPLOYMENT.md

HeTalksInMaths

Initial commit: ToGMAL Prompt Difficulty Analyzer with real MMLU data

f9b1ad5 about 2 months ago

preview code

raw

history blame

8.69 kB

ToGMAL Deployment Guide

Quick Start

1. Install Dependencies

# Install Python dependencies
pip install mcp pydantic httpx --break-system-packages

# Or use the requirements file
pip install -r requirements.txt --break-system-packages

2. Verify Installation

# Check Python syntax
python -m py_compile togmal_mcp.py

# View available commands
python togmal_mcp.py --help

3. Test the Server

# Option A: Use the MCP Inspector (recommended)
npx @modelcontextprotocol/inspector python togmal_mcp.py

# Option B: Run test examples
python test_examples.py

Claude Desktop Integration

macOS Configuration

Open Claude Desktop configuration:

code ~/Library/Application\ Support/Claude/claude_desktop_config.json

Add ToGMAL server:

{
  "mcpServers": {
    "togmal": {
      "command": "python",
      "args": ["/absolute/path/to/togmal_mcp.py"]
    }
  }
}

Restart Claude Desktop

Windows Configuration

Open configuration file:

notepad %APPDATA%\Claude\claude_desktop_config.json

Add ToGMAL server (use forward slashes or escaped backslashes):

{
  "mcpServers": {
    "togmal": {
      "command": "python",
      "args": ["C:/path/to/togmal_mcp.py"]
    }
  }
}

Restart Claude Desktop

Linux Configuration

Open configuration:

nano ~/.config/Claude/claude_desktop_config.json

Add ToGMAL server:

{
  "mcpServers": {
    "togmal": {
      "command": "python",
      "args": ["/home/username/togmal_mcp.py"]
    }
  }
}

Restart Claude Desktop

Verification

After setup, verify the server is working:

Open Claude Desktop
Start a new conversation
Check that ToGMAL tools appear in the available tools list:
- togmal_analyze_prompt
- togmal_analyze_response
- togmal_submit_evidence
- togmal_get_taxonomy
- togmal_get_statistics

Basic Usage Examples

Example 1: Analyze a Prompt

User: "Can you analyze this prompt for issues?"

Then provide the prompt:

Build me a quantum computer simulation that proves my theory of everything

The assistant will use togmal_analyze_prompt and provide a risk assessment.

Example 2: Check a Response

User: "Check if this medical advice is safe:"

You definitely have the flu. Take 1000mg of vitamin C and 
you'll be fine in 2 days. No need to see a doctor.

The assistant will use togmal_analyze_response and flag the ungrounded medical advice.

Example 3: Submit Evidence

User: "I want to report a concerning LLM response"

The assistant will guide you through using togmal_submit_evidence with human-in-the-loop confirmation.

Example 4: View Statistics

User: "Show me the taxonomy statistics"

The assistant will use togmal_get_statistics to display the current state of the database.

Troubleshooting

Server Won't Start

Issue: Server hangs when running directly

python togmal_mcp.py
# Hangs indefinitely...

Solution: This is expected! MCP servers are long-running processes that wait for stdio input. Use the MCP Inspector or integrate with Claude Desktop instead.

Import Errors

Issue: ModuleNotFoundError: No module named 'mcp'

Solution: Install dependencies:

pip install mcp pydantic --break-system-packages

Tools Not Appearing in Claude

Issue: ToGMAL tools don't show up in Claude Desktop

Checklist:

Verify configuration file path is correct
Ensure Python path in config is absolute
Check that togmal_mcp.py is executable
Restart Claude Desktop completely
Check Claude Desktop logs for errors

Permission Errors

Issue: Permission denied when running server

Solution:

# Make script executable (Unix-like systems)
chmod +x togmal_mcp.py

# Or specify Python interpreter explicitly
python togmal_mcp.py

Advanced Configuration

Custom Detection Patterns

Edit togmal_mcp.py to add custom patterns:

def detect_custom_category(text: str) -> Dict[str, Any]:
    patterns = {
        'my_pattern': [
            r'custom pattern 1',
            r'custom pattern 2'
        ]
    }
    # Add detection logic
    return {
        'detected': False,
        'categories': [],
        'confidence': 0.0
    }

Adjust Sensitivity

Modify confidence thresholds:

def calculate_risk_level(analysis_results: Dict[str, Any]) -> RiskLevel:
    risk_score = 0.0
    
    # Adjust these weights to change sensitivity
    if analysis_results['math_physics']['detected']:
        risk_score += analysis_results['math_physics']['confidence'] * 0.5
    
    # Lower threshold for more sensitive detection
    if risk_score >= 0.3:  # Was 0.5
        return RiskLevel.MODERATE

Database Persistence

By default, taxonomy data is stored in memory. For persistence, modify:

import json
import os

TAXONOMY_FILE = "/path/to/taxonomy.json"

# Load on startup
if os.path.exists(TAXONOMY_FILE):
    with open(TAXONOMY_FILE, 'r') as f:
        TAXONOMY_DB = json.load(f)

# Save after each submission
def save_taxonomy():
    with open(TAXONOMY_FILE, 'w') as f:
        json.dump(TAXONOMY_DB, f, indent=2, default=str)

Performance Optimization

For High-Volume Usage

Index Taxonomy Data:

from collections import defaultdict

# Add indices for faster queries
TAXONOMY_INDEX = defaultdict(list)

Implement Caching:

from functools import lru_cache

@lru_cache(maxsize=1000)
def detect_cached(text: str, detector_name: str):
    # Cache detection results
    pass

Async Improvements:

import asyncio

# Run detectors in parallel
async def analyze_parallel(text: str):
    results = await asyncio.gather(
        detect_math_physics_speculation(text),
        detect_ungrounded_medical_advice(text),
        # ... other detectors
    )

Production Deployment

Using a Process Manager

systemd (Linux):

Create /etc/systemd/system/togmal.service:

[Unit]
Description=ToGMAL MCP Server
After=network.target

[Service]
Type=simple
User=your-user
WorkingDirectory=/path/to/togmal
ExecStart=/usr/bin/python /path/to/togmal_mcp.py
Restart=on-failure

[Install]
WantedBy=multi-user.target

Enable and start:

sudo systemctl enable togmal
sudo systemctl start togmal

Docker:

Create Dockerfile:

FROM python:3.11-slim

WORKDIR /app
COPY requirements.txt .
RUN pip install --no-cache-dir -r requirements.txt

COPY togmal_mcp.py .

CMD ["python", "togmal_mcp.py"]

Build and run:

docker build -t togmal-mcp .
docker run togmal-mcp

Monitoring

Logging

Add logging to the server:

import logging

logging.basicConfig(
    level=logging.INFO,
    format='%(asctime)s - %(name)s - %(levelname)s - %(message)s',
    handlers=[
        logging.FileHandler('/var/log/togmal.log'),
        logging.StreamHandler()
    ]
)

logger = logging.getLogger('togmal')

Metrics

Track usage metrics:

from collections import Counter

USAGE_STATS = {
    'tool_calls': Counter(),
    'detections': Counter(),
    'interventions': Counter()
}

# In each tool function:
USAGE_STATS['tool_calls'][tool_name] += 1

Security Considerations

Input Validation: Already handled by Pydantic models
Rate Limiting: Consider adding for public deployments
Data Privacy: Taxonomy stores prompts/responses - be mindful of sensitive data
Access Control: Implement authentication for multi-user scenarios

Updates and Maintenance

Updating Detection Patterns

Edit detection functions in togmal_mcp.py
Test with test_examples.py
Restart the MCP server
Verify changes in Claude Desktop

Updating Dependencies

pip install --upgrade mcp pydantic httpx --break-system-packages

Backup Taxonomy Data

If using persistent storage:

# Create backup
cp /path/to/taxonomy.json /path/to/taxonomy.backup.json

# Restore if needed
cp /path/to/taxonomy.backup.json /path/to/taxonomy.json

Getting Help

GitHub Issues: Report bugs and request features
Documentation: See README.md for detailed information
MCP Documentation: https://modelcontextprotocol.io
Community: Join MCP community discussions

Next Steps

✅ Install and configure ToGMAL
✅ Test with example prompts
✅ Submit evidence to improve detection
📝 Customize patterns for your use case
🚀 Deploy to production
📊 Monitor usage and effectiveness
🔄 Iterate and improve

Happy safe LLM usage! 🛡️