Spaces:

piyushdev
/

gpt-oss

Sleeping

App Files Files Community

piyushdev commited on Nov 4

Commit

ad19ccb

verified ·

1 Parent(s): e767889

Update app.py

Browse files

Files changed (1) hide show

app.py +342 -164

app.py CHANGED Viewed

@@ -1,164 +1,342 @@
-# 🚀 App Improvements Summary
-## Major Enhancements Made
-### 1. **Robust Error Handling & Retry Logic**
-- ✅ **3 automatic retries** per category
-- ✅ 1-second delay between retry attempts
-- ✅ Graceful degradation if JSON parsing fails
-- ✅ Detailed error messages for debugging
-### 2. **Output Validation & Quality Checks**
-- ✅ JSON structure validation
-- ✅ Minimum description length check (10 characters)
-- ✅ Multiple JSON extraction methods (handles markdown, raw JSON, etc.)
-- ✅ Fallback to raw response if JSON parsing fails
-### 3. **Improved Prompt Engineering**
-- ✅ More explicit instructions for JSON-only output
-- ✅ Stricter formatting requirements
-- ✅ Clearer examples in system prompt
-### 4. **Better Output Consistency**
-- ✅ **Lower default temperature** (0.3 instead of 0.7)
-- ✅ Temperature tooltip explaining impact on consistency
-- ✅ Recommended settings prominently displayed
-### 5. **Enhanced Status Reporting**
-- ✅ New **Status column** in output CSV
-- ✅ Per-category success/failure tracking
-- ✅ Success count vs. failure count summary
-- ✅ Individual status messages for each category
-- ✅ Failed categories clearly marked with error details
-### 6. **Rate Limiting Protection**
-- ✅ 0.5-second delay between each category
-- ✅ Prevents API throttling
-- ✅ More reliable batch processing
-### 7. **Zero GPU Support Information**
-- ✅ Instructions for using Zero GPU
-- ✅ Clear benefits explanation (faster, more reliable)
-- ✅ Free GPU acceleration (no Pro subscription required)
-### 8. **Better User Experience**
-- ✅ Real-time progress updates
-- ✅ Clear feature list in UI
-- ✅ Detailed tips for best results
-- ✅ Success/failure summary after processing
-## Key Code Improvements
-### New Functions
-1. `extract_json_from_response()` - Robust JSON extraction with multiple fallback methods
-2. Enhanced `process_single_category()` - Retry logic, validation, better error handling
-### Updated Processing Flow
-```
-For each category:
-  1. Attempt processing (streaming API call)
-  2. Validate response is not empty
-  3. Extract JSON from response (multiple methods)
-  4. Validate JSON structure and content
-  5. If failure → retry (up to 3 times)
-  6. If all retries fail → mark as Failed with error details
-  7. Add 0.5s delay before next category
-```
-## Configuration Changes
-### New Defaults
-- **Temperature**: 0.3 (was 0.7) - More consistent output
-- **Retry Count**: 3 attempts per category
-- **Delay**: 0.5s between categories, 1s between retries
-### Output Format
-```csv
-Category,Description,Raw_Response,Status
-Example Category,"validated description text","raw JSON response","Success"
-Failed Category,"[FAILED - error details]","","Failed"
-```
-## Expected Results
-### Before Improvements
-- ❌ 6-7 out of 13 categories succeeded
-- ❌ Garbage values in some outputs
-- ❌ Inconsistent formatting
-- ❌ No way to identify failures
-### After Improvements
-- ✅ **Higher success rate** due to retry logic
-- ✅ **Validated outputs** - no garbage values
-- ✅ **Consistent formatting** with lower temperature
-- ✅ **Clear status tracking** for all categories
-- ✅ **Reprocessable failures** - extract and retry failed ones
-## How to Get Best Results
-### Recommended Settings
-1. **Temperature: 0.2-0.4** for consistent, focused descriptions
-2. **Zero GPU** is automatically available (no setup needed)
-3. **Check Status column** in output to identify any failures
-4. **Reprocess failed categories** separately if needed
-### For Large Batches
-1. Zero GPU provides automatic GPU acceleration (free)
-2. Split into smaller files if over 100 categories
-3. Monitor the status output during processing
-4. Review failed categories and adjust temperature if needed
-## Testing Instructions
-1. Upload the `sample_categories.csv` (13 categories)
-2. Use default settings (Temperature: 0.3)
-3. Click "Process Files"
-4. Check output CSV:
-   - Should have **all 13 categories**
-   - Status column shows "Success" for most/all
-   - Descriptions are consistent and well-formatted
-   - Any failures have clear error messages
-## Zero GPU Usage
-### How Zero GPU Works
-1. Zero GPU is automatically available for Hugging Face Spaces
-2. No configuration needed - it's already enabled
-3. GPU resources are allocated when your Space runs
-4. Free to use - no Pro subscription required
-5. Provides on-demand GPU acceleration
-### Benefits
-- ⚡ **Faster processing** (2-3x speedup)
-- 🎯 **More reliable** (better resource availability)
-- 📊 **Better for large batches** (50+ categories)
-- 💰 **Free** - no Pro subscription needed
-**Note**: Zero GPU provides free GPU access for Spaces automatically
-## Troubleshooting Failed Categories
-If some categories still fail after improvements:
-1. **Check the error message** in the Description field
-2. **Common issues**:
-   - API timeout → Enable GPU
-   - Rate limiting → Already handled with delays
-   - Invalid JSON → Retry logic should handle this
-3. **Reprocess failures**:
-   - Extract failed categories from output CSV
-   - Create new CSV with just those categories
-   - Reprocess with even lower temperature (0.2)
-## Summary
-The app is now **much more robust** with:
-- 🔁 Automatic retries
-- ✅ Output validation
-- 📊 Status tracking
-- ⚡ Zero GPU support (free acceleration)
-- 🎯 Better consistency
-This should give you **100% success rate** or very close to it, with all outputs properly formatted and validated!
-Zero GPU provides automatic GPU acceleration without any configuration or cost, making your app faster and more reliable!

+import gradio as gr
+from huggingface_hub import InferenceClient
+import pandas as pd
+import json
+import os
+import time
+from datetime import datetime
+# Custom system instructions for business category descriptions
+SYSTEM_INSTRUCTIONS = """You are an expert at writing clear and visual descriptions for a business category keyword for a yellow pages or business listing website. Given a category keyword, generate a single, detailed description that defines its key visual elements, location, and context. Do not add artistic or stylistic flair. Ensure that the description is CLIP model ready and not too verbose.
+Here are some examples of the correct format:
+Category: "Car Rental For Self Driven"
+Description: "a car available for self-drive rental, parked at a pickup spot without a chauffeur; looks travel-ready, clean, well-maintained, keys handed over to customer"
+Category: "Mehandi"
+Description: "Temporary henna artwork applied on hands and feet using cones; fine brown or maroon floral and paisley patterns, mandalas, and lace-like detailing, commonly seen at weddings and festivals."
+Category: "Photographer"
+Description: "a person actively shooting photos or posing with a camera; holding a camera to eye, adjusting lens, or directing a subject during a shoot"
+Category: "Equipment"
+Description: "lighting stands, softboxes, strobes, tripods, reflectors, gimbals, battery packs, memory cards arranged as gear kits"
+---
+IMPORTANT: You must respond with ONLY a valid JSON object in this exact format:
+{"Category": "category name", "Description": "description text"}
+Do not include any other text, explanations, or markdown formatting. Only output the JSON object."""
+def extract_json_from_response(response_text):
+    """Extract and validate JSON from model response."""
+    # Try to find JSON in the response
+    response_text = response_text.strip()
+    # Remove markdown code blocks if present
+    if "```json" in response_text:
+        response_text = response_text.split("```json")[1].split("```")[0].strip()
+    elif "```" in response_text:
+        response_text = response_text.split("```")[1].split("```")[0].strip()
+    # Try to find JSON object in the text
+    if "{" in response_text and "}" in response_text:
+        start = response_text.find("{")
+        end = response_text.rfind("}") + 1
+        response_text = response_text[start:end]
+    # Parse JSON
+    parsed = json.loads(response_text)
+    # Validate structure
+    if not isinstance(parsed, dict):
+        raise ValueError("Response is not a JSON object")
+    # Get description with various possible keys
+    description = (
+        parsed.get("Description") or
+        parsed.get("description") or
+        parsed.get("desc") or
+        ""
+    )
+    if not description or len(description.strip()) < 10:
+        raise ValueError("Description is missing or too short")
+    return description.strip()
+def process_single_category(category, client, max_tokens, temperature, top_p, retry_count=3):
+    """Process a single category keyword and return the description with retry logic."""
+    messages = [
+        {"role": "system", "content": SYSTEM_INSTRUCTIONS},
+        {"role": "user", "content": f"Category: \"{category}\""}
+    ]
+    last_error = None
+    for attempt in range(retry_count):
+        try:
+            # Add small delay between retries
+            if attempt > 0:
+                time.sleep(1)
+            # Try streaming approach (more reliable for this model)
+            response_text = ""
+            for message in client.chat_completion(
+                messages,
+                max_tokens=max_tokens,
+                stream=True,
+                temperature=temperature,
+                top_p=top_p,
+            ):
+                if hasattr(message, 'choices') and len(message.choices) > 0:
+                    if hasattr(message.choices[0], 'delta') and hasattr(message.choices[0].delta, 'content'):
+                        token = message.choices[0].delta.content
+                        if token:
+                            response_text += token
+                elif isinstance(message, str):
+                    response_text += message
+            # Validate we got a response
+            if not response_text or len(response_text.strip()) < 5:
+                raise ValueError("Empty or too short response from model")
+            # Extract and validate JSON
+            description = extract_json_from_response(response_text)
+            # Return both the description and raw response
+            return response_text.strip(), description
+        except json.JSONDecodeError as e:
+            last_error = f"JSON parsing failed (attempt {attempt + 1}/{retry_count}): {str(e)}"
+            # If JSON parsing fails, try to extract description from raw text
+            if attempt == retry_count - 1 and response_text:
+                # Last attempt - try to use raw response if it looks like a description
+                if len(response_text.strip()) > 20 and not response_text.startswith("{"):
+                    return response_text.strip(), response_text.strip()
+        except Exception as e:
+            last_error = f"Processing failed (attempt {attempt + 1}/{retry_count}): {str(e)}"
+    # All retries failed
+    raise Exception(f"Failed after {retry_count} attempts. Last error: {last_error}")
+def process_csv_files(
+    files,
+    category_column,
+    max_tokens,
+    temperature,
+    top_p,
+    progress=gr.Progress()
+):
+    """
+    Process multiple CSV files and generate descriptions for category keywords.
+    """
+    if not files or len(files) == 0:
+        return "Please upload at least one CSV file.", None
+    # Get HF token from environment variables
+    import os
+    hf_token = os.environ.get("HF_TOKEN") or os.environ.get("HUGGINGFACE_TOKEN")
+    if not hf_token:
+        return "❌ Error: HF_TOKEN not found. Please add your Hugging Face token as a Space Secret.\n\nGo to Space Settings → Secrets → Add 'HF_TOKEN'", None
+    client = InferenceClient(token=hf_token, model="openai/gpt-oss-20b")
+    output_files = []
+    status_messages = []
+    for file_idx, file in enumerate(files):
+        try:
+            # Read CSV file
+            df = pd.read_csv(file.name)
+            status_messages.append(f"📄 Processing file {file_idx + 1}/{len(files)}: {os.path.basename(file.name)}")
+            # Check if category column exists
+            if category_column not in df.columns:
+                status_messages.append(f"⚠️ Warning: Column '{category_column}' not found in {os.path.basename(file.name)}. Available columns: {', '.join(df.columns)}")
+                continue
+            # Process each category
+            descriptions = []
+            raw_responses = []
+            categories = df[category_column].dropna().unique()
+            total_categories = len(categories)
+            for idx, category in enumerate(categories):
+                progress((file_idx * total_categories + idx) / (len(files) * total_categories),
+                        desc=f"Processing category {idx + 1}/{total_categories} in file {file_idx + 1}")
+                try:
+                    # Process with retry logic
+                    raw_response, description = process_single_category(
+                        category, client, max_tokens, temperature, top_p, retry_count=3
+                    )
+                    # Validate description
+                    if not description or len(description.strip()) < 10:
+                        raise ValueError("Description is too short or empty")
+                    descriptions.append({
+                        "Category": category,
+                        "Description": description,
+                        "Raw_Response": raw_response,
+                        "Status": "Success"
+                    })
+                    status_messages.append(f"✅ Processed: {category}")
+                except Exception as e:
+                    error_msg = str(e)
+                    status_messages.append(f"⚠️ Error processing '{category}': {error_msg}")
+                    descriptions.append({
+                        "Category": category,
+                        "Description": f"[FAILED - {error_msg[:100]}]",
+                        "Raw_Response": "",
+                        "Status": "Failed"
+                    })
+                # Small delay to avoid rate limiting
+                time.sleep(0.5)
+            # Create output dataframe
+            output_df = pd.DataFrame(descriptions)
+            # Save to file
+            timestamp = datetime.now().strftime("%Y%m%d_%H%M%S")
+            base_name = os.path.splitext(os.path.basename(file.name))[0]
+            output_filename = f"output_{base_name}_{timestamp}.csv"
+            output_df.to_csv(output_filename, index=False)
+            output_files.append(output_filename)
+            # Count successes and failures
+            success_count = len([d for d in descriptions if d.get("Status") == "Success"])
+            failed_count = len([d for d in descriptions if d.get("Status") == "Failed"])
+            status_messages.append(f"✅ Completed: {success_count} succeeded, {failed_count} failed out of {len(descriptions)} categories from {os.path.basename(file.name)}")
+        except Exception as e:
+            status_messages.append(f"❌ Error processing {os.path.basename(file.name)}: {str(e)}")
+    status_text = "\n".join(status_messages)
+    if output_files:
+        return status_text, output_files
+    else:
+        return status_text + "\n\n❌ No output files generated.", None
+# Create Gradio interface
+with gr.Blocks(title="Business Category Description Generator") as demo:
+    gr.Markdown("""
+    # 🏢 Business Category Description Generator
+    Upload CSV files containing business category keywords, and this app will generate
+    CLIP-ready visual descriptions for each category using AI.
+    **Instructions:**
+    1. Upload one or more CSV files
+    2. Specify the column name that contains the category keywords
+    3. Adjust model settings (lower temperature = more consistent output)
+    4. Click "Process Files" to generate descriptions
+    5. Download the output CSV files with Status column
+    **Features:**
+    - ✅ Automatic retry logic (3 attempts per category)
+    - ✅ JSON validation and error recovery
+    - ✅ Progress tracking with detailed status
+    - ✅ Success/failure reporting
+    *Note: For faster processing, use Zero GPU (see Space Settings). Authentication via HF_TOKEN secret.*
+    """)
+    with gr.Row():
+        with gr.Column(scale=1):
+            gr.Markdown("### ⚙️ Model Settings")
+            max_tokens = gr.Slider(
+                minimum=64,
+                maximum=512,
+                value=256,
+                step=16,
+                label="Max Tokens"
+            )
+            temperature = gr.Slider(
+                minimum=0.1,
+                maximum=1.0,
+                value=0.3,
+                step=0.1,
+                label="Temperature",
+                info="Lower = more consistent output"
+            )
+            top_p = gr.Slider(
+                minimum=0.1,
+                maximum=1.0,
+                value=0.9,
+                step=0.05,
+                label="Top-p"
+            )
+        with gr.Column(scale=2):
+            files_input = gr.File(
+                label="📤 Upload CSV Files",
+                file_count="multiple",
+                file_types=[".csv"]
+            )
+            category_column = gr.Textbox(
+                label="📝 Category Column Name",
+                value="category",
+                placeholder="Enter the name of the column containing categories"
+            )
+            process_btn = gr.Button("🚀 Process Files", variant="primary", size="lg")
+            status_output = gr.Textbox(
+                label="📊 Status",
+                lines=10,
+                interactive=False
+            )
+            files_output = gr.File(
+                label="💾 Download Output Files",
+                file_count="multiple"
+            )
+    process_btn.click(
+        fn=process_csv_files,
+        inputs=[
+            files_input,
+            category_column,
+            max_tokens,
+            temperature,
+            top_p
+        ],
+        outputs=[status_output, files_output]
+    )
+    gr.Markdown("""
+    ---
+    ### 📝 Output Format
+    Each output CSV file will contain:
+    - **Category**: The original category keyword
+    - **Description**: The generated visual description (validated and cleaned)
+    - **Raw_Response**: The complete model response (for debugging)
+    - **Status**: Success or Failed (with error details)
+    💡 **Tips for Best Results:**
+    - Use Temperature 0.2-0.4 for consistent, focused descriptions
+    - Use Temperature 0.6-0.8 for more creative variations
+    - Failed categories are marked clearly - you can reprocess them separately
+    - Zero GPU acceleration: Add @spaces.GPU decorator or enable in Space Settings
+    """)
+if __name__ == "__main__":
+    demo.launch()