Spaces:

nazdridoy
/

inferoxy-hub

Running

App Files Files Community

nazdridoy commited on Aug 21

Commit

79a256a

verified ·

1 Parent(s): 6cb2ff0

feat(core): default to auto model provider

Browse files

- [docs] Update "Custom Model with Provider" heading to "Model Examples" (README.md:102)
- [docs] Add "Auto provider" example and modify "Specific provider" example (README.md:105-108)
- [docs] Add "Auto" option description under "Chat Providers" (README.md:141)
- [docs] Update "Chat Examples" for auto and specific providers (README.md:178-183)
- [feat] Change default "Model Name" value to "openai/gpt-oss-20b" in Chat Assistant (app.py:198-199)
- [feat] Update "Model Name" placeholder text for clarity (app.py:198-199)
- [docs] Add "auto" provider description to "Popular Providers" markdown (app.py:246)
- [docs] Update model examples under "Popular Providers" markdown (app.py:250-251)

Files changed (2) hide show

README.md +15 -3
app.py +62 -45

README.md CHANGED Viewed

@@ -102,8 +102,12 @@ The app requires:
 3. Adjust parameters if needed (temperature, model, etc.)
 4. Watch the AI respond with streaming text
-#### Custom Model with Provider
 ```
 Model Name: openai/gpt-oss-20b:fireworks-ai
 System Message: You are a helpful coding assistant specializing in Python.
 ```
@@ -141,6 +145,7 @@ System Message: You are a helpful coding assistant specializing in Python.
 ## 🎯 Provider-Specific Features
 ### Chat Providers
 - **Fireworks AI**: Fast and reliable inference service
 - **Cerebras**: High-performance inference with low latency
 - **Cohere**: Advanced language models with multilingual support
@@ -175,8 +180,15 @@ System Message: You are a helpful coding assistant specializing in Python.
 #### Chat Examples
 ```
-"Explain quantum computing in simple terms"
-"Help me debug this Python code: [paste code]"
 "Write a creative story about a time-traveling cat"
 "What are the pros and cons of renewable energy?"
 ```

 3. Adjust parameters if needed (temperature, model, etc.)
 4. Watch the AI respond with streaming text
+#### Model Examples
 ```
+# Auto provider (default - let HF choose best)
+Model Name: openai/gpt-oss-20b
+# Specific provider
 Model Name: openai/gpt-oss-20b:fireworks-ai
 System Message: You are a helpful coding assistant specializing in Python.
 ```
 ## 🎯 Provider-Specific Features
 ### Chat Providers
+- **Auto**: Let HuggingFace choose the best provider (default)
 - **Fireworks AI**: Fast and reliable inference service
 - **Cerebras**: High-performance inference with low latency
 - **Cohere**: Advanced language models with multilingual support
 #### Chat Examples
 ```
+# Using auto provider (default)
+Model: openai/gpt-oss-20b
+Prompt: "Explain quantum computing in simple terms"
+# Using specific provider
+Model: openai/gpt-oss-20b:fireworks-ai
+Prompt: "Help me debug this Python code: [paste code]"
+# Other example prompts:
 "Write a creative story about a time-traveling cat"
 "What are the pros and cons of renewable energy?"
 ```

app.py CHANGED Viewed

@@ -116,7 +116,11 @@ def generate_image(
     try:
         # Get token from HF-Inferoxy proxy server
         token, token_id = get_proxy_token(api_key=proxy_api_key)
         # Create client with specified provider
         client = InferenceClient(
@@ -124,6 +128,8 @@ def generate_image(
             api_key=token
         )
         # Prepare generation parameters
         generation_params = {
             "model": model_name,
@@ -140,9 +146,14 @@ def generate_image(
         if seed != -1:
             generation_params["seed"] = seed
         # Generate image
         image = client.text_to_image(**generation_params)
         # Report successful token usage
         report_token_status(token_id, "success", api_key=proxy_api_key)
@@ -188,64 +199,70 @@ with gr.Blocks(title="HF-Inferoxy AI Hub", theme=gr.themes.Soft()) as demo:
         # ==================== CHAT TAB ====================
         with gr.Tab("💬 Chat Assistant", id="chat"):
             with gr.Row():
-                with gr.Column(scale=3):
-                    # Create chat interface
-                    chatbot = gr.ChatInterface(
-                        chat_respond,
-                        type="messages",
-                        title="",
-                        description="",
-                        additional_inputs=[
-                            gr.Textbox(
-                                value="You are a helpful and friendly AI assistant. Provide clear, accurate, and helpful responses.",
-                                label="System Message",
-                                lines=2,
-                                placeholder="Define the assistant's personality and behavior..."
-                            ),
-                            gr.Textbox(
-                                value="openai/gpt-oss-20b:fireworks-ai",
-                                label="Model Name",
-                                placeholder="e.g., openai/gpt-oss-20b:fireworks-ai or mistralai/Mistral-7B-Instruct-v0.2:groq"
-                            ),
-                            gr.Slider(
-                                minimum=1, maximum=4096, value=1024, step=1,
-                                label="Max New Tokens"
-                            ),
-                            gr.Slider(
-                                minimum=0.1, maximum=2.0, value=0.7, step=0.1,
-                                label="Temperature"
-                            ),
-                            gr.Slider(
-                                minimum=0.1, maximum=1.0, value=0.95, step=0.05,
-                                label="Top-p (nucleus sampling)"
-                            ),
-                        ],
-                    )
-                with gr.Column(scale=1):
                     gr.Markdown("""
                     ### 💡 Chat Tips
                     **Model Format:**
-                    - Single model: `openai/gpt-oss-20b`
-                    - With provider: `model:provider`
                     **Popular Models:**
                     - `openai/gpt-oss-20b` - Fast general purpose
                     - `meta-llama/Llama-2-7b-chat-hf` - Chat optimized
                     - `microsoft/DialoGPT-medium` - Conversation
                     - `google/flan-t5-base` - Instruction following
-                    **Popular Providers:**
-                    - `fireworks-ai` - Fast and reliable
-                    - `cerebras` - High performance
-                    - `groq` - Ultra-fast inference
-                    - `together` - Wide model support
-                    - `cohere` - Advanced language models
-                    **Example:**
-                    `openai/gpt-oss-20b:fireworks-ai`
                     """)
         # ==================== IMAGE GENERATION TAB ====================

     try:
         # Get token from HF-Inferoxy proxy server
+        print(f"🔑 Image: Requesting token from proxy...")
         token, token_id = get_proxy_token(api_key=proxy_api_key)
+        print(f"✅ Image: Got token: {token_id}")
+        print(f"🎨 Image: Using model='{model_name}', provider='{provider}'")
         # Create client with specified provider
         client = InferenceClient(
             api_key=token
         )
+        print(f"🚀 Image: Client created, preparing generation params...")
         # Prepare generation parameters
         generation_params = {
             "model": model_name,
         if seed != -1:
             generation_params["seed"] = seed
+        print(f"📐 Image: Dimensions: {width}x{height}, steps: {num_inference_steps}, guidance: {guidance_scale}")
+        print(f"📡 Image: Making generation request...")
         # Generate image
         image = client.text_to_image(**generation_params)
+        print(f"🖼️ Image: Generation completed! Image type: {type(image)}")
         # Report successful token usage
         report_token_status(token_id, "success", api_key=proxy_api_key)
         # ==================== CHAT TAB ====================
         with gr.Tab("💬 Chat Assistant", id="chat"):
+            # Main chat interface - full width and prominent
+            chatbot = gr.ChatInterface(
+                chat_respond,
+                type="messages",
+                title="",
+                description="",
+                additional_inputs=[
+                    gr.Textbox(
+                        value="openai/gpt-oss-20b",
+                        label="Model Name",
+                        placeholder="e.g., openai/gpt-oss-20b or openai/gpt-oss-20b:fireworks-ai"
+                    ),
+                    gr.Textbox(
+                        value="You are a helpful and friendly AI assistant. Provide clear, accurate, and helpful responses.",
+                        label="System Message",
+                        lines=2,
+                        placeholder="Define the assistant's personality and behavior..."
+                    ),
+                    gr.Slider(
+                        minimum=1, maximum=4096, value=1024, step=1,
+                        label="Max New Tokens"
+                    ),
+                    gr.Slider(
+                        minimum=0.1, maximum=2.0, value=0.7, step=0.1,
+                        label="Temperature"
+                    ),
+                    gr.Slider(
+                        minimum=0.1, maximum=1.0, value=0.95, step=0.05,
+                        label="Top-p (nucleus sampling)"
+                    ),
+                ],
+            )
+            # Configuration tips below the chat
             with gr.Row():
+                with gr.Column():
                     gr.Markdown("""
                     ### 💡 Chat Tips
                     **Model Format:**
+                    - Single model: `openai/gpt-oss-20b` (uses auto provider)
+                    - With provider: `openai/gpt-oss-20b:fireworks-ai`
                     **Popular Models:**
                     - `openai/gpt-oss-20b` - Fast general purpose
                     - `meta-llama/Llama-2-7b-chat-hf` - Chat optimized
                     - `microsoft/DialoGPT-medium` - Conversation
                     - `google/flan-t5-base` - Instruction following
+                    """)
+                with gr.Column():
+                    gr.Markdown("""
+                    ### 🚀 Popular Providers
+                    - **auto** - Let HF choose best provider (default)
+                    - **fireworks-ai** - Fast and reliable
+                    - **cerebras** - High performance
+                    - **groq** - Ultra-fast inference
+                    - **together** - Wide model support
+                    - **cohere** - Advanced language models
+                    **Examples:**
+                    - `openai/gpt-oss-20b` (auto provider)
+                    - `openai/gpt-oss-20b:fireworks-ai` (specific provider)
                     """)
         # ==================== IMAGE GENERATION TAB ====================