πŸ€– Qwen2-VL 2B Furniture Analysis LoRA

A fine-tuned adaptation of Qwen2-VL 2B using Low-Rank Adaptation (LoRA) for structured furniture attribute extraction and detailed captioning.

Model Description

Base Model: Qwen2-VL 2B
Fine-tuning Method: LoRA (Low-Rank Adaptation)
Training Dataset: Synthetic Furniture Dataset (10,000 images)
Specialization: Furniture detection, classification, and structured attribute extraction

🎯 Capabilities

The model excels at analyzing furniture images and extracting structured information including:

  • πŸͺ‘ Furniture Type Classification: Accurate identification of beds, tables, sofas, and chairs
  • 🎨 Style Recognition: Design style categorization (modern, traditional, minimalist, etc.)
  • 🌈 Color Analysis: Predominant color identification and description
  • πŸ”¨ Material Detection: Recognition of wood, metal, fabric, leather, and composite materials
  • πŸ“ Shape Characterization: Physical form and geometric properties
  • ✨ Detail Extraction: Decorative elements, hardware, and functional features
  • 🏠 Room Context: Appropriate room placement recommendations
  • πŸ’° Price Estimation: Relative price range categorization

βš™οΈ Technical Specifications

  • Architecture: Vision-Language Transformer with LoRA adapters
  • Input Resolution: 448x448 pixels (optimized for dataset)
  • Output Format: Structured JSON with predefined attribute schema
  • Memory Footprint: Significantly reduced compared to full fine-tuning
  • Inference Speed: Optimized for real-time furniture analysis

πŸ“Š Training Details

  • Training Images: 9,000 synthetic furniture images
  • Validation: 1,000 real furniture photographs
  • Image Generation: Stable Diffusion Medium 3.5
  • Test Set Annotation: Qwen2 VL 72B
  • Categories: Bed, table, sofa, chair

⚠️ Limitations

  • Category Scope: Limited to four main furniture categories (bed, table, sofa, chair)
  • Synthetic Training Bias: Potential domain gap between synthetic training data and real-world furniture
  • Language Support: Optimized for English descriptions
  • Image Quality: Best performance on well-lit, clear furniture images

πŸ“¦ Model Artifacts

  • LoRA Adapters: Lightweight adaptation weights for efficient deployment
  • Configuration Files: Training hyperparameters and model settings
  • Evaluation Metrics: Performance benchmarks on test dataset
  • Example Outputs: Sample structured responses for reference

This LoRA adaptation enables efficient furniture analysis while maintaining the general capabilities of the base Qwen2-VL model.

Downloads last month
5
Safetensors
Model size
2B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for filnow/furniture-caption-qwen-lora

Base model

Qwen/Qwen2-VL-2B
Finetuned
(315)
this model

Dataset used to train filnow/furniture-caption-qwen-lora