Spaces:

JatinAutonomousLabs
/

Research_AI_Assistant

Sleeping

App Files Files Community

JatsTheAIGen commited on Oct 29

Commit

e440f24

1 Parent(s): 1ca61f9

Log Process Flow as JSON for review v2

Browse files

Files changed (5) hide show

LLM_LOGGING_ENHANCEMENT.md +186 -0
llm_router.py +62 -0
src/agents/synthesis_agent.py +30 -0
src/llm_router.py +62 -0
test_logging.py +116 -0

LLM_LOGGING_ENHANCEMENT.md ADDED Viewed

	@@ -0,0 +1,186 @@

+# LLM API Response Logging Enhancement
+## Overview
+This document describes the comprehensive logging enhancements implemented to ensure all LLM API inference responses are printed without truncation in container logs.
+## Changes Made
+### 1. Enhanced LLM Router Logging (`src/llm_router.py` and `llm_router.py`)
+#### Complete API Request Logging
+- **Full Prompt Content**: Logs the complete prompt sent to the LLM without truncation
+- **API Request Details**: Logs all request parameters including:
+  - API URL
+  - Model ID
+  - Task Type
+  - Max Tokens
+  - Temperature
+  - Top P
+  - User Message Length
+- **Complete API Payload**: Logs the full JSON payload sent to the API
+#### Complete API Response Logging
+- **Response Metadata**: Logs complete API response metadata including:
+  - Status Code
+  - Response Headers
+  - Response Size
+  - Complete API Response JSON
+- **Full Response Content**: Logs the complete LLM-generated response without any truncation
+- **Structured Format**: Uses clear delimiters and formatting for easy identification in logs
+### 2. Enhanced Synthesis Agent Logging (`src/agents/synthesis_agent.py`)
+#### Response Synthesis Logging
+- **Complete LLM Response**: Logs the full response received from LLM for synthesis
+- **Agent Context**: Includes agent ID, task type, and response length
+- **Structured Format**: Uses clear delimiters for easy log parsing
+#### Narrative Summary Logging
+- **Complete Summary Response**: Logs the full narrative summary generated by LLM
+- **Context Information**: Includes interaction count and summary length
+- **Structured Format**: Uses clear delimiters for easy identification
+## Log Format Examples
+### API Request Logging
+```
+================================================================================
+LLM API REQUEST - COMPLETE PROMPT:
+================================================================================
+Model: mistralai/Mistral-7B-Instruct-v0.2
+Task Type: response_synthesis
+Prompt Length: 1234 characters
+----------------------------------------
+FULL PROMPT CONTENT:
+----------------------------------------
+[Complete prompt content here]
+----------------------------------------
+END OF PROMPT
+================================================================================
+```
+### API Response Logging
+```
+================================================================================
+LLM API RESPONSE METADATA:
+================================================================================
+Status Code: 200
+Response Headers: {'content-type': 'application/json', ...}
+Response Size: 5678 characters
+----------------------------------------
+COMPLETE API RESPONSE JSON:
+----------------------------------------
+{
+  "choices": [
+    {
+      "message": {
+        "content": "[Full response content]"
+      }
+    }
+  ]
+}
+----------------------------------------
+END OF API RESPONSE METADATA
+================================================================================
+```
+### Complete Response Content Logging
+```
+================================================================================
+COMPLETE LLM API RESPONSE:
+================================================================================
+Model: mistralai/Mistral-7B-Instruct-v0.2
+Task Type: response_synthesis
+Response Length: 1234 characters
+----------------------------------------
+FULL RESPONSE CONTENT:
+----------------------------------------
+[Complete LLM response without truncation]
+----------------------------------------
+END OF LLM RESPONSE
+================================================================================
+```
+## Container Log Benefits
+### 1. Complete Visibility
+- **No Truncation**: All LLM responses are logged in full
+- **Request/Response Pairs**: Complete visibility into API interactions
+- **Metadata Included**: Full context for debugging and monitoring
+### 2. Easy Parsing
+- **Clear Delimiters**: Uses consistent formatting with `=` and `-` characters
+- **Structured Sections**: Each log section is clearly marked
+- **Searchable Format**: Easy to grep for specific log sections
+### 3. Debugging Support
+- **Full Context**: Complete prompts and responses for debugging
+- **Error Tracking**: Complete API response metadata for error analysis
+- **Performance Monitoring**: Response sizes and timing information
+## Usage in Container Environments
+### Docker Logs
+```bash
+# View all logs
+docker logs <container_id>
+# Follow logs in real-time
+docker logs -f <container_id>
+# Search for LLM responses
+docker logs <container_id> | grep "COMPLETE LLM API RESPONSE"
+```
+### Kubernetes Logs
+```bash
+# View pod logs
+kubectl logs <pod-name>
+# Follow logs
+kubectl logs -f <pod-name>
+# Search for specific log sections
+kubectl logs <pod-name> | grep "FULL RESPONSE CONTENT"
+```
+## Files Modified
+1. **`Research_AI_Assistant/src/llm_router.py`**
+   - Added comprehensive API request logging
+   - Added complete API response metadata logging
+   - Added full response content logging
+2. **`Research_AI_Assistant/llm_router.py`**
+   - Added comprehensive API request logging
+   - Added complete API response metadata logging
+   - Added full response content logging
+3. **`Research_AI_Assistant/src/agents/synthesis_agent.py`**
+   - Added complete LLM response logging for synthesis
+   - Added complete narrative summary logging
+   - Added structured formatting for easy parsing
+4. **`Research_AI_Assistant/test_logging.py`** (New)
+   - Test script to verify logging configuration
+   - Tests both basic logging and LLM-specific logging
+## Verification
+The logging configuration has been tested and verified to:
+- ✅ Log complete prompts without truncation
+- ✅ Log complete API responses without truncation
+- ✅ Include all metadata and context information
+- ✅ Use clear, structured formatting
+- ✅ Work in container environments
+- ✅ Be easily searchable and parseable
+## Benefits
+1. **Complete Transparency**: All LLM interactions are fully visible in logs
+2. **Debugging Support**: Full context for troubleshooting issues
+3. **Performance Monitoring**: Complete response data for analysis
+4. **Container Compatibility**: Works seamlessly in Docker/Kubernetes environments
+5. **No Data Loss**: No truncation means no loss of important information
+This enhancement ensures that all LLM API inference responses are captured in full detail in container logs, providing complete visibility into the system's AI interactions.

llm_router.py CHANGED Viewed

@@ -84,6 +84,19 @@ class LLMRouter:
             logger.info(f"Calling HF Chat Completions API for model: {model_id}")
             logger.debug(f"Prompt length: {len(prompt)}")
             headers = {
                 "Authorization": f"Bearer {self.hf_token}",
@@ -107,11 +120,47 @@ class LLMRouter:
                 "top_p": kwargs.get("top_p", 0.95)
             }
             # Make the API call
             response = requests.post(api_url, json=payload, headers=headers, timeout=60)
             if response.status_code == 200:
                 result = response.json()
                 # Handle chat completions response format
                 if "choices" in result and len(result["choices"]) > 0:
                     message = result["choices"][0].get("message", {})
@@ -123,6 +172,19 @@ class LLMRouter:
                         return None
                     logger.info(f"HF API returned response (length: {len(generated_text)})")
                     return generated_text
                 else:
                     logger.error(f"Unexpected response format: {result}")

             logger.info(f"Calling HF Chat Completions API for model: {model_id}")
             logger.debug(f"Prompt length: {len(prompt)}")
+            logger.info("=" * 80)
+            logger.info("LLM API REQUEST - COMPLETE PROMPT:")
+            logger.info("=" * 80)
+            logger.info(f"Model: {model_id}")
+            logger.info(f"Task Type: {task_type}")
+            logger.info(f"Prompt Length: {len(prompt)} characters")
+            logger.info("-" * 40)
+            logger.info("FULL PROMPT CONTENT:")
+            logger.info("-" * 40)
+            logger.info(prompt)
+            logger.info("-" * 40)
+            logger.info("END OF PROMPT")
+            logger.info("=" * 80)
             headers = {
                 "Authorization": f"Bearer {self.hf_token}",
                 "top_p": kwargs.get("top_p", 0.95)
             }
+            # Log complete API request details
+            logger.info("=" * 80)
+            logger.info("LLM API REQUEST DETAILS:")
+            logger.info("=" * 80)
+            logger.info(f"API URL: {api_url}")
+            logger.info(f"Model: {model_id}")
+            logger.info(f"Task Type: {task_type}")
+            logger.info(f"Max Tokens: {kwargs.get('max_tokens', 2000)}")
+            logger.info(f"Temperature: {kwargs.get('temperature', 0.7)}")
+            logger.info(f"Top P: {kwargs.get('top_p', 0.95)}")
+            logger.info(f"User Message Length: {len(user_message)} characters")
+            logger.info("-" * 40)
+            logger.info("API PAYLOAD:")
+            logger.info("-" * 40)
+            import json
+            logger.info(json.dumps(payload, indent=2))
+            logger.info("-" * 40)
+            logger.info("END OF API REQUEST")
+            logger.info("=" * 80)
             # Make the API call
             response = requests.post(api_url, json=payload, headers=headers, timeout=60)
             if response.status_code == 200:
                 result = response.json()
+                # Log complete API response metadata
+                logger.info("=" * 80)
+                logger.info("LLM API RESPONSE METADATA:")
+                logger.info("=" * 80)
+                logger.info(f"Status Code: {response.status_code}")
+                logger.info(f"Response Headers: {dict(response.headers)}")
+                logger.info(f"Response Size: {len(response.text)} characters")
+                logger.info("-" * 40)
+                logger.info("COMPLETE API RESPONSE JSON:")
+                logger.info("-" * 40)
+                logger.info(json.dumps(result, indent=2))
+                logger.info("-" * 40)
+                logger.info("END OF API RESPONSE METADATA")
+                logger.info("=" * 80)
                 # Handle chat completions response format
                 if "choices" in result and len(result["choices"]) > 0:
                     message = result["choices"][0].get("message", {})
                         return None
                     logger.info(f"HF API returned response (length: {len(generated_text)})")
+                    logger.info("=" * 80)
+                    logger.info("COMPLETE LLM API RESPONSE:")
+                    logger.info("=" * 80)
+                    logger.info(f"Model: {model_id}")
+                    logger.info(f"Task Type: {task_type}")
+                    logger.info(f"Response Length: {len(generated_text)} characters")
+                    logger.info("-" * 40)
+                    logger.info("FULL RESPONSE CONTENT:")
+                    logger.info("-" * 40)
+                    logger.info(generated_text)
+                    logger.info("-" * 40)
+                    logger.info("END OF LLM RESPONSE")
+                    logger.info("=" * 80)
                     return generated_text
                 else:
                     logger.error(f"Unexpected response format: {result}")

src/agents/synthesis_agent.py CHANGED Viewed

@@ -112,6 +112,19 @@ class ResponseSynthesisAgent:
                     # Clean up the response
                     clean_response = llm_response.strip()
                     logger.info(f"{self.agent_id} received LLM response (length: {len(clean_response)})")
                     return {
                         "draft_response": clean_response,
@@ -283,6 +296,23 @@ Summary:"""
                 # Remove any "Summary:" prefix if present
                 if clean_summary.startswith("Summary:"):
                     clean_summary = clean_summary[9:].strip()
                 return clean_summary
         except Exception as e:

                     # Clean up the response
                     clean_response = llm_response.strip()
                     logger.info(f"{self.agent_id} received LLM response (length: {len(clean_response)})")
+                    logger.info("=" * 80)
+                    logger.info("SYNTHESIS AGENT - COMPLETE LLM RESPONSE:")
+                    logger.info("=" * 80)
+                    logger.info(f"Agent: {self.agent_id}")
+                    logger.info(f"Task: response_synthesis")
+                    logger.info(f"Response Length: {len(clean_response)} characters")
+                    logger.info("-" * 40)
+                    logger.info("FULL LLM RESPONSE CONTENT:")
+                    logger.info("-" * 40)
+                    logger.info(clean_response)
+                    logger.info("-" * 40)
+                    logger.info("END OF SYNTHESIS LLM RESPONSE")
+                    logger.info("=" * 80)
                     return {
                         "draft_response": clean_response,
                 # Remove any "Summary:" prefix if present
                 if clean_summary.startswith("Summary:"):
                     clean_summary = clean_summary[9:].strip()
+                # Log the complete narrative summary response
+                logger.info("=" * 80)
+                logger.info("NARRATIVE SUMMARY - COMPLETE LLM RESPONSE:")
+                logger.info("=" * 80)
+                logger.info(f"Agent: {self.agent_id}")
+                logger.info(f"Task: narrative_summary")
+                logger.info(f"Interactions Count: {len(interactions)}")
+                logger.info(f"Summary Length: {len(clean_summary)} characters")
+                logger.info("-" * 40)
+                logger.info("FULL NARRATIVE SUMMARY CONTENT:")
+                logger.info("-" * 40)
+                logger.info(clean_summary)
+                logger.info("-" * 40)
+                logger.info("END OF NARRATIVE SUMMARY RESPONSE")
+                logger.info("=" * 80)
                 return clean_summary
         except Exception as e:

src/llm_router.py CHANGED Viewed

@@ -85,6 +85,19 @@ class LLMRouter:
             logger.info(f"Calling HF Chat Completions API for model: {model_id}")
             logger.debug(f"Prompt length: {len(prompt)}")
             headers = {
                 "Authorization": f"Bearer {self.hf_token}",
@@ -108,11 +121,47 @@ class LLMRouter:
                 "top_p": kwargs.get("top_p", 0.95)
             }
             # Make the API call
             response = requests.post(api_url, json=payload, headers=headers, timeout=60)
             if response.status_code == 200:
                 result = response.json()
                 # Handle chat completions response format
                 if "choices" in result and len(result["choices"]) > 0:
                     message = result["choices"][0].get("message", {})
@@ -124,6 +173,19 @@ class LLMRouter:
                         return None
                     logger.info(f"HF API returned response (length: {len(generated_text)})")
                     return generated_text
                 else:
                     logger.error(f"Unexpected response format: {result}")

             logger.info(f"Calling HF Chat Completions API for model: {model_id}")
             logger.debug(f"Prompt length: {len(prompt)}")
+            logger.info("=" * 80)
+            logger.info("LLM API REQUEST - COMPLETE PROMPT:")
+            logger.info("=" * 80)
+            logger.info(f"Model: {model_id}")
+            logger.info(f"Task Type: {task_type}")
+            logger.info(f"Prompt Length: {len(prompt)} characters")
+            logger.info("-" * 40)
+            logger.info("FULL PROMPT CONTENT:")
+            logger.info("-" * 40)
+            logger.info(prompt)
+            logger.info("-" * 40)
+            logger.info("END OF PROMPT")
+            logger.info("=" * 80)
             headers = {
                 "Authorization": f"Bearer {self.hf_token}",
                 "top_p": kwargs.get("top_p", 0.95)
             }
+            # Log complete API request details
+            logger.info("=" * 80)
+            logger.info("LLM API REQUEST DETAILS:")
+            logger.info("=" * 80)
+            logger.info(f"API URL: {api_url}")
+            logger.info(f"Model: {model_id}")
+            logger.info(f"Task Type: {task_type}")
+            logger.info(f"Max Tokens: {kwargs.get('max_tokens', 2000)}")
+            logger.info(f"Temperature: {kwargs.get('temperature', 0.7)}")
+            logger.info(f"Top P: {kwargs.get('top_p', 0.95)}")
+            logger.info(f"User Message Length: {len(user_message)} characters")
+            logger.info("-" * 40)
+            logger.info("API PAYLOAD:")
+            logger.info("-" * 40)
+            import json
+            logger.info(json.dumps(payload, indent=2))
+            logger.info("-" * 40)
+            logger.info("END OF API REQUEST")
+            logger.info("=" * 80)
             # Make the API call
             response = requests.post(api_url, json=payload, headers=headers, timeout=60)
             if response.status_code == 200:
                 result = response.json()
+                # Log complete API response metadata
+                logger.info("=" * 80)
+                logger.info("LLM API RESPONSE METADATA:")
+                logger.info("=" * 80)
+                logger.info(f"Status Code: {response.status_code}")
+                logger.info(f"Response Headers: {dict(response.headers)}")
+                logger.info(f"Response Size: {len(response.text)} characters")
+                logger.info("-" * 40)
+                logger.info("COMPLETE API RESPONSE JSON:")
+                logger.info("-" * 40)
+                logger.info(json.dumps(result, indent=2))
+                logger.info("-" * 40)
+                logger.info("END OF API RESPONSE METADATA")
+                logger.info("=" * 80)
                 # Handle chat completions response format
                 if "choices" in result and len(result["choices"]) > 0:
                     message = result["choices"][0].get("message", {})
                         return None
                     logger.info(f"HF API returned response (length: {len(generated_text)})")
+                    logger.info("=" * 80)
+                    logger.info("COMPLETE LLM API RESPONSE:")
+                    logger.info("=" * 80)
+                    logger.info(f"Model: {model_id}")
+                    logger.info(f"Task Type: {task_type}")
+                    logger.info(f"Response Length: {len(generated_text)} characters")
+                    logger.info("-" * 40)
+                    logger.info("FULL RESPONSE CONTENT:")
+                    logger.info("-" * 40)
+                    logger.info(generated_text)
+                    logger.info("-" * 40)
+                    logger.info("END OF LLM RESPONSE")
+                    logger.info("=" * 80)
                     return generated_text
                 else:
                     logger.error(f"Unexpected response format: {result}")

test_logging.py ADDED Viewed

	@@ -0,0 +1,116 @@

+#!/usr/bin/env python3
+"""
+Test script to verify comprehensive LLM API logging
+This script tests the logging configuration to ensure all LLM responses
+are logged without truncation in container logs.
+"""
+import logging
+import asyncio
+import sys
+import os
+# Add the src directory to the path
+sys.path.insert(0, '.')
+sys.path.insert(0, 'src')
+# Configure logging to match the main application
+logging.basicConfig(
+    level=logging.INFO,
+    format='%(asctime)s - %(name)s - %(levelname)s - %(message)s',
+    handlers=[
+        logging.StreamHandler(),
+        logging.FileHandler('test_logging.log')
+    ]
+)
+logger = logging.getLogger(__name__)
+async def test_llm_logging():
+    """Test the LLM logging functionality"""
+    logger.info("=" * 80)
+    logger.info("TESTING LLM API LOGGING CONFIGURATION")
+    logger.info("=" * 80)
+    try:
+        # Import the LLM router
+        from src.llm_router import LLMRouter
+        from src.config import settings
+        logger.info("✓ Successfully imported LLM router and config")
+        # Initialize LLM router with a test token
+        test_token = os.getenv('HF_TOKEN', 'test_token')
+        llm_router = LLMRouter(test_token)
+        logger.info("✓ LLM router initialized")
+        # Test a simple inference call
+        test_prompt = "Hello, this is a test prompt for logging verification."
+        test_task = "response_synthesis"
+        logger.info(f"Testing with prompt: {test_prompt}")
+        logger.info(f"Task type: {test_task}")
+        # This will trigger the comprehensive logging
+        result = await llm_router.route_inference(
+            task_type=test_task,
+            prompt=test_prompt,
+            max_tokens=100,
+            temperature=0.7
+        )
+        if result:
+            logger.info("✓ LLM call completed successfully")
+            logger.info(f"Result length: {len(result)} characters")
+        else:
+            logger.info("✓ LLM call completed (fallback used)")
+        logger.info("=" * 80)
+        logger.info("LOGGING TEST COMPLETED")
+        logger.info("=" * 80)
+        logger.info("Check the logs above for:")
+        logger.info("1. Complete API request details")
+        logger.info("2. Full prompt content")
+        logger.info("3. Complete API response metadata")
+        logger.info("4. Full LLM response content")
+        logger.info("5. All responses should be untruncated")
+        logger.info("=" * 80)
+    except ImportError as e:
+        logger.error(f"Import error: {e}")
+        logger.info("This is expected if dependencies are not available")
+    except Exception as e:
+        logger.error(f"Test error: {e}")
+        logger.info("This is expected if API calls fail")
+def test_logging_configuration():
+    """Test basic logging configuration"""
+    logger.info("Testing basic logging configuration...")
+    # Test different log levels
+    logger.debug("This is a debug message")
+    logger.info("This is an info message")
+    logger.warning("This is a warning message")
+    logger.error("This is an error message")
+    # Test long message logging
+    long_message = "This is a very long message that should not be truncated. " * 50
+    logger.info(f"Long message test (length: {len(long_message)}): {long_message}")
+    logger.info("✓ Basic logging configuration test completed")
+if __name__ == "__main__":
+    print("Testing LLM API Logging Configuration")
+    print("=" * 50)
+    # Test basic logging first
+    test_logging_configuration()
+    # Test LLM logging
+    asyncio.run(test_llm_logging())
+    print("\nTest completed. Check 'test_logging.log' file for detailed logs.")
+    print("In container environments, these logs will appear in container output.")