Spaces:

pollen-robotics
/

reachy_mini_conversation_app

Running

App Files Files Community

Alina Lozovskaya commited on Oct 21

Commit

6261fdc

1 Parent(s): c51b4e7

Update readme and comments

Browse files

Files changed (5) hide show

README.md +8 -8
docs/assets/{conversation_demo_arch.svg → conversation_app_arch.svg} +0 -0
src/reachy_mini_conversation_app/config.py +1 -1
src/reachy_mini_conversation_app/main.py +3 -3
src/reachy_mini_conversation_app/utils.py +1 -1

README.md CHANGED Viewed

@@ -1,15 +1,15 @@
-# Reachy Mini conversation demo
-Conversational demo for the Reachy Mini robot combining OpenAI's realtime APIs, vision pipelines, and choreographed motion libraries.
 ![Reachy Mini Dance](docs/assets/reachy_mini_dance.gif)
 ## Architecture
-The demo follows a layered architecture connecting the user, AI services, and robot hardware:
 <p align="center">
-  <img src="docs/assets/conversation_demo_arch.svg" alt="Architecture Diagram" width="600"/>
 </p>
 ## Overview
@@ -96,7 +96,7 @@ Some wheels (e.g. PyTorch) are large and require compatible CUDA or CPU builds
 Activate your virtual environment, ensure the Reachy Mini robot (or simulator) is reachable, then launch:
 ```bash
-reachy-mini-conversation-demo
 ```
 By default, the app runs in console mode for direct audio interaction. Use the `--gradio` flag to launch a web UI served locally at http://127.0.0.1:7860/ (required when running in simulation mode). With a camera attached, vision is handled by the gpt-realtime model when the camera tool is used. For local vision processing, use the `--local-vision` flag to process frames periodically using the SmolVLM2 model. Additionally, you can enable face tracking via YOLO or MediaPipe pipelines depending on the extras you installed.
@@ -116,19 +116,19 @@ By default, the app runs in console mode for direct audio interaction. Use the `
 - Run on hardware with MediaPipe face tracking:
   ```bash
-  reachy-mini-conversation-demo --head-tracker mediapipe
   ```
 - Run with local vision processing (requires `local_vision` extra):
   ```bash
-  reachy-mini-conversation-demo --local-vision
   ```
 - Disable the camera pipeline (audio-only conversation):
   ```bash
-  reachy-mini-conversation-demo --no-camera
   ```
 ## LLM tools exposed to the assistant

+# Reachy Mini conversation app
+Conversational app for the Reachy Mini robot combining OpenAI's realtime APIs, vision pipelines, and choreographed motion libraries.
 ![Reachy Mini Dance](docs/assets/reachy_mini_dance.gif)
 ## Architecture
+The app follows a layered architecture connecting the user, AI services, and robot hardware:
 <p align="center">
+  <img src="docs/assets/conversation_app_arch.svg" alt="Architecture Diagram" width="600"/>
 </p>
 ## Overview
 Activate your virtual environment, ensure the Reachy Mini robot (or simulator) is reachable, then launch:
 ```bash
+reachy-mini-conversation-app
 ```
 By default, the app runs in console mode for direct audio interaction. Use the `--gradio` flag to launch a web UI served locally at http://127.0.0.1:7860/ (required when running in simulation mode). With a camera attached, vision is handled by the gpt-realtime model when the camera tool is used. For local vision processing, use the `--local-vision` flag to process frames periodically using the SmolVLM2 model. Additionally, you can enable face tracking via YOLO or MediaPipe pipelines depending on the extras you installed.
 - Run on hardware with MediaPipe face tracking:
   ```bash
+  reachy-mini-conversation-app --head-tracker mediapipe
   ```
 - Run with local vision processing (requires `local_vision` extra):
   ```bash
+  reachy-mini-conversation-app --local-vision
   ```
 - Disable the camera pipeline (audio-only conversation):
   ```bash
+  reachy-mini-conversation-app --no-camera
   ```
 ## LLM tools exposed to the assistant

docs/assets/{conversation_demo_arch.svg → conversation_app_arch.svg} RENAMED Viewed

File without changes

src/reachy_mini_conversation_app/config.py CHANGED Viewed

@@ -26,7 +26,7 @@ logger.info("Configuration loaded from .env file")
 class Config:
-    """Configuration class for the conversation demo."""
     # Required
     OPENAI_API_KEY = os.getenv("OPENAI_API_KEY")

 class Config:
+    """Configuration class for the conversation app."""
     # Required
     OPENAI_API_KEY = os.getenv("OPENAI_API_KEY")

src/reachy_mini_conversation_app/main.py CHANGED Viewed

@@ -1,4 +1,4 @@
-"""Entrypoint for the Reachy Mini conversation demo."""
 import os
 import sys
@@ -28,11 +28,11 @@ def update_chatbot(chatbot: List[Dict[str, Any]], response: Dict[str, Any]) -> L
 def main() -> None:
-    """Entrypoint for the Reachy Mini conversation demo."""
     args = parse_args()
     logger = setup_logger(args.debug)
-    logger.info("Starting Reachy Mini Conversation Demo")
     if args.no_camera and args.head_tracker is not None:
         logger.warning("Head tracking is not activated due to --no-camera.")

+"""Entrypoint for the Reachy Mini conversation app."""
 import os
 import sys
 def main() -> None:
+    """Entrypoint for the Reachy Mini conversation app."""
     args = parse_args()
     logger = setup_logger(args.debug)
+    logger.info("Starting Reachy Mini Conversation App")
     if args.no_camera and args.head_tracker is not None:
         logger.warning("Head tracking is not activated due to --no-camera.")

src/reachy_mini_conversation_app/utils.py CHANGED Viewed

@@ -9,7 +9,7 @@ from reachy_mini_conversation_app.camera_worker import CameraWorker
 def parse_args() -> argparse.Namespace:
     """Parse command line arguments."""
-    parser = argparse.ArgumentParser("Reachy Mini Conversation Demo")
     parser.add_argument(
         "--head-tracker",
         choices=["yolo", "mediapipe", None],

 def parse_args() -> argparse.Namespace:
     """Parse command line arguments."""
+    parser = argparse.ArgumentParser("Reachy Mini Conversation App")
     parser.add_argument(
         "--head-tracker",
         choices=["yolo", "mediapipe", None],