Spaces:

MCP-1st-Birthday
/

wrdler

Running

Surn commited on 9 days ago

Commit

d57c213

1 Parent(s): 850b1df

Enhance AI word generation and update to v0.1.1

Updated Wrdler to version 0.1.1 with significant enhancements to AI word generation, including intelligent word saving, retry mechanisms, and a 1000-word file size limit. Introduced dual generation modes using Hugging Face Space API and local transformers. Improved logging for better pipeline visibility. Updated documentation and added a new test for word file validation. Added new AI-generated word lists and a cooking-themed word list.

Files changed (12) hide show

CLAUDE.md +37 -14
README.md +32 -1
env.template +1 -0
specs/requirements.md +82 -11
specs/specs.md +37 -16
tests/test_word_file_validation.py +67 -0
wrdler/__init__.py +1 -1
wrdler/modules/__init__.py +5 -0
wrdler/word_loader_ai.py +245 -36
wrdler/words/cooking.txt +180 -0
wrdler/words/english.txt +24 -3
wrdler/words/wordlist.txt +27 -6

CLAUDE.md CHANGED Viewed

@@ -7,13 +7,22 @@ Wrdler is a simplified vocabulary puzzle game based on BattleWords, with these k
 - **No scope/radar visualization**
 - **2 free letter guesses at game start** (all instances of chosen letters are revealed)
-**Current Version:** 0.1.0
 **Repository:** https://github.com/Oncorporation/Wrdler.git
 **Live Demo:** [DEPLOYMENT_URL_HERE]
 ## Recent Changes
-**v0.1.0 (Current):**
 - ✅ Version updated to 0.1.0 across all files
 - ✅ AI word generation functionality added
 - ✅ Word list management enhanced with AI support
@@ -157,12 +166,21 @@ wrdler/
   - No gameplay logic changes required
   - Works offline for basic functionality
-### ✅ AI Word Generation (v0.1.0)
 - **AI-Powered Word Lists:** Generate custom word lists using Hugging Face Spaces or local transformers
 - **Topic-Based Generation:** Create words related to specific themes (e.g., "Ocean Life", "Space")
-- **Automatic Expansion:** New AI-generated words are saved to local files for future use
 - **Fallback Support:** Gracefully falls back to dictionary words if AI is unavailable
 - **Word Distribution:** Ensures exactly 25 words each of lengths 4, 5, and 6 per topic
 ### PLANNED: Local Player Storage (v0.3.0)
 - **Local Storage:**
@@ -210,7 +228,7 @@ wrdler/
 ### Development Status
-**Current Version:** 0.1.0 (Complete)
 - ✅ All 7 sprints complete
 - ✅ 100% test coverage (25/25 tests)
 - ✅ AI word generation implemented
@@ -347,6 +365,15 @@ The dataset repository will contain:
 ## Post-v0.0.2 Enhancements
 ### v0.1.0 (AI Word Generation)
 - AI-powered word list generation using Hugging Face Spaces
 - Topic-based word creation with automatic saving
@@ -496,19 +523,15 @@ From `pyproject.toml`:
 ## Version History Summary
-- **v0.1.0** (Current) - AI word generation, utility modules, version bump
-- **v0.0.4** (Previous) - Documentation sync, version update
 - **v0.0.2-0.0.3** - All 7 sprints complete, core Wrdler features
 - **v0.2.20-0.2.29** - Challenge Mode, PWA, remote storage (inherited from BattleWords)
 - **v0.1.x** - Initial BattleWords releases before Wrdler fork
-See README.md for complete changelog.
 ---
 **Last Updated:** 2025-01-31
-**Current Version:** 0.1.0
-**Status:** Production Ready - All Features Complete ✅
-## Test File Location
-All test files must be placed in the `/tests` folder. This ensures a clean project structure and makes it easy to discover and run all tests.

 - **No scope/radar visualization**
 - **2 free letter guesses at game start** (all instances of chosen letters are revealed)
+**Current Version:** 0.1.1
 **Repository:** https://github.com/Oncorporation/Wrdler.git
 **Live Demo:** [DEPLOYMENT_URL_HERE]
 ## Recent Changes
+**v0.1.1 (Current):**
+- ✅ Enhanced AI word generation logic with intelligent word saving
+- ✅ Automatic retry mechanism for insufficient word counts (up to 3 retries)
+- ✅1000-word file size limit to prevent dictionary bloat
+- ✅ Better new word detection (separates existing vs. new words before saving)
+- ✅ Improved HF Space API integration with graceful fallback to local models
+- ✅ Additional word generation when initial pass doesn't meet MIN_REQUIRED threshold
+- ✅ Enhanced logging for word generation pipeline visibility
+**v0.1.0 (Previous):**
 - ✅ Version updated to 0.1.0 across all files
 - ✅ AI word generation functionality added
 - ✅ Word list management enhanced with AI support
   - No gameplay logic changes required
   - Works offline for basic functionality
+### ✅ AI Word Generation (v0.1.0+)
 - **AI-Powered Word Lists:** Generate custom word lists using Hugging Face Spaces or local transformers
 - **Topic-Based Generation:** Create words related to specific themes (e.g., "Ocean Life", "Space")
+- **Automatic Word Expansion:** New AI-generated words are saved to local files for future use
+  - Intelligent word detection: separates existing dictionary words from new AI-generated words
+  - Only new words are saved to prevent duplicates
+  - Automatic retry mechanism (up to 3 attempts) if insufficient words generated
+  - 1000-word file size limit prevents dictionary bloat
+  - Files auto-sorted by length then alphabetically
 - **Fallback Support:** Gracefully falls back to dictionary words if AI is unavailable
 - **Word Distribution:** Ensures exactly 25 words each of lengths 4, 5, and 6 per topic
+- **Dual Generation Modes:**
+  - **HF Space API** (primary): Uses Hugging Face Space for word generation when `USE_HF_WORDS=true`
+  - **Local Models** (fallback): Falls back to local transformers models if HF Space unavailable
+- **Enhanced Logging:** Detailed pipeline visibility for debugging and monitoring
 ### PLANNED: Local Player Storage (v0.3.0)
 - **Local Storage:**
 ### Development Status
+**Current Version:** 0.1.1 (Complete)
 - ✅ All 7 sprints complete
 - ✅ 100% test coverage (25/25 tests)
 - ✅ AI word generation implemented
 ## Post-v0.0.2 Enhancements
+### v0.1.1 (AI Word Generation Enhancement)
+- Enhanced AI word generation with intelligent word saving
+- Automatic retry mechanism for insufficient word counts (up to 3 retries)
+- 1000-word file size limit to prevent dictionary bloat
+- Improved new word detection (separates existing vs. new words)
+- Better HF Space API integration with fallback to local models
+- Additional word generation when MIN_REQUIRED threshold not met
+- Enhanced logging for generation pipeline visibility
 ### v0.1.0 (AI Word Generation)
 - AI-powered word list generation using Hugging Face Spaces
 - Topic-based word creation with automatic saving
 ## Version History Summary
+- **v0.1.1** (Current) - Enhanced AI word generation with intelligent saving, retry logic, file size limits
+- **v0.1.0** (Previous) - AI word generation, utility modules, version bump
+- **v0.0.4** - Documentation sync, version update
 - **v0.0.2-0.0.3** - All 7 sprints complete, core Wrdler features
 - **v0.2.20-0.2.29** - Challenge Mode, PWA, remote storage (inherited from BattleWords)
 - **v0.1.x** - Initial BattleWords releases before Wrdler fork
 ---
 **Last Updated:** 2025-01-31
+**Current Version:** 0.1.1
+**Status:** Production Ready - AI Enhanced ✅

README.md CHANGED Viewed

@@ -13,7 +13,8 @@ tags:
 - vocabulary
 - streamlit
 - education
-short_description: Fast paced word guessing game
 thumbnail: >-
   https://cdn-uploads.huggingface.co/production/uploads/6346595c9e5f0fe83fc60444/6rWS4AIaozoNMCbx9F5Rv.png
 ---
@@ -50,6 +51,20 @@ Wrdler is a vocabulary learning game with a simplified grid and strategic letter
 - Sound effects for hits, misses, correct/incorrect guesses
 - Responsive UI built with Streamlit
 ### Customization
 - Multiple word lists (classic, fourth_grade, wordlist)
 - Wordlist sidebar controls (picker + one-click sort)
@@ -181,6 +196,22 @@ All test files must be placed in the `/tests` folder. This ensures a clean proje
 ## Changelog
 ### v0.0.8
  - remove background animation
  - add "easy" mode (single guess per reveal)

 - vocabulary
 - streamlit
 - education
+- ai
+short_description: Fast paced word guessing game with AI-generated word lists
 thumbnail: >-
   https://cdn-uploads.huggingface.co/production/uploads/6346595c9e5f0fe83fc60444/6rWS4AIaozoNMCbx9F5Rv.png
 ---
 - Sound effects for hits, misses, correct/incorrect guesses
 - Responsive UI built with Streamlit
+### AI Word Generation
+- **Topic-based word lists**: Generate custom word lists using AI for any theme
+- **Intelligent word expansion**: New AI-generated words automatically saved to local files
+  - Smart detection separates existing dictionary words from new AI words
+  - Only saves new words to prevent duplicates
+  - Automatic retry mechanism (up to 3 attempts) for insufficient word counts
+  - 1000-word file size limit prevents bloat
+  - Auto-sorted by length then alphabetically
+- **Dual generation modes**:
+  - **HF Space API** (primary): Uses Hugging Face Space when `USE_HF_WORDS=true`
+  - **Local transformers** (fallback): Falls back to local models if HF unavailable
+- **Fallback support**: Gracefully uses dictionary words if AI generation fails
+- **Guaranteed distribution**: Ensures exactly 25 words each of lengths 4, 5, and 6
 ### Customization
 - Multiple word lists (classic, fourth_grade, wordlist)
 - Wordlist sidebar controls (picker + one-click sort)
 ## Changelog
+### v0.1.1 (Current)
+- ✅ Enhanced AI word generation with intelligent word saving
+- ✅ Automatic retry mechanism for insufficient word counts (up to 3 retries)
+- ✅ 1000-word file size limit to prevent dictionary bloat
+- ✅ Improved new word detection (separates existing vs. new words before saving)
+- ✅ Better HF Space API integration with graceful fallback to local models
+- ✅ Additional word generation when initial pass doesn't meet MIN_REQUIRED threshold
+- ✅ Enhanced logging for word generation pipeline visibility
+### v0.1.0
+- ✅ AI word generation functionality added
+- ✅ Topic-based custom word list creation
+- ✅ Dual generation modes (HF Space API + local transformers)
+- ✅ Utility modules integration (storage, file_utils, constants)
+- ✅ Documentation synchronized across all files
 ### v0.0.8
  - remove background animation
  - add "easy" mode (single guess per reveal)

env.template CHANGED Viewed

@@ -15,6 +15,7 @@ TMPDIR=/tmp
 # Flash attention setting (optional)
 # USE_FLASH_ATTENTION=1
 CRYPTO_PK=btc_public_key_here
 IS_LOCAL=true

 # Flash attention setting (optional)
 # USE_FLASH_ATTENTION=1
+TF_ENABLE_ONEDNN_OPTS=0
 CRYPTO_PK=btc_public_key_here
 IS_LOCAL=true

specs/requirements.md CHANGED Viewed

@@ -1,11 +1,11 @@
 # Wrdler: Implementation Requirements
-**Version:** 0.1.0
-**Status:** All Features Complete - Ready for Deployment
 **Last Updated:** 2025-01-31
 This document breaks down the implementation tasks for Wrdler using the game rules described in `specs.md`. Wrdler is based on BattleWords but with a simplified 8x6 grid, horizontal-only words, and free letter guesses at the start.
-**Current Status:** ✅ All Phase 1 requirements complete, 100% tested (25/25 tests passing)
 ## Key Differences from BattleWords
 - 8x6 grid instead of 12x12
@@ -14,14 +14,15 @@ This document breaks down the implementation tasks for Wrdler using the game rul
 - No radar/scope visualization
 - 2 free letter guesses at game start
-## Implementation Details (v0.0.2)
-- **Tech Stack:** Python 3.12.8, Streamlit 1.51.0, numpy, matplotlib
 - **Architecture:** Single-player, local state in Streamlit session state
 - **Grid:** 8 columns × 6 rows (48 cells) with exactly six words
 - **Word Placement:** Horizontal-only, one word per row, no overlaps
 - **Entry Point:** `app.py`
 - **Testing:** pytest with 25/25 tests passing (100%)
-- **Development Time:** ~12.75 hours across 7 sprints
 ## Streamlit Components (Implemented in v0.0.2)
 - State & caching ✅
@@ -88,15 +89,27 @@ This document breaks down the implementation tasks for Wrdler using the game rul
 **Acceptance:** ✅ Types implemented and fully integrated (13/13 tests passing)
-### 2) Word List Management ✅ (Sprint 1)
 - ✅ English word list filtered to alphabetic uppercase, lengths in {4,5,6}
 - ✅ Loader centralized in `word_loader.py` with caching
 - ✅ Three word lists: classic, fourth_grade, wordlist
-- ✅ AI word generation support via `word_loader_ai.py` (generates 75 words per topic)
 - ✅ Unified loader (`load_word_list_or_ai`) routes between file-based and AI-generated words
 - ✅ Saves new AI-generated words to local files for expansion
-**Acceptance:** ✅ Loading function returns lists by length with >= 25 words per length
 ### 3) Puzzle Generation (8x6 Horizontal) ✅ (Sprint 2)
 - ✅ Randomly place 6 words on 8x6 grid, one per row
@@ -194,6 +207,12 @@ This document breaks down the implementation tasks for Wrdler using the game rul
 - 📋 Player statistics
 - 📋 Enhanced UI animations
 ### v1.0.0 (Long Term)
 - 📋 Multiple difficulty levels
 - 📋 Daily puzzle mode
@@ -214,8 +233,60 @@ This document breaks down the implementation tasks for Wrdler using the game rul
 ---
 **Last Updated:** 2025-01-31
-**Version:** 0.1.0
-**Status:** All Features Complete - Ready for Deployment 🚀
 ## Test File Location
 All test files must be placed in the `/tests` folder. This ensures a clean project structure and makes it easy to discover and run all tests.

 # Wrdler: Implementation Requirements
+**Version:** 0.1.1
+**Status:** Production Ready - AI Enhanced
 **Last Updated:** 2025-01-31
 This document breaks down the implementation tasks for Wrdler using the game rules described in `specs.md`. Wrdler is based on BattleWords but with a simplified 8x6 grid, horizontal-only words, and free letter guesses at the start.
+**Current Status:** ✅ All Phase 1 requirements complete, 100% tested (25/25 tests passing), AI word generation enhanced in v0.1.1
 ## Key Differences from BattleWords
 - 8x6 grid instead of 12x12
 - No radar/scope visualization
 - 2 free letter guesses at game start
+## Implementation Details (v0.1.1)
+- **Tech Stack:** Python 3.12.8, Streamlit 1.51.0, numpy, matplotlib, transformers, gradio_client
 - **Architecture:** Single-player, local state in Streamlit session state
 - **Grid:** 8 columns × 6 rows (48 cells) with exactly six words
 - **Word Placement:** Horizontal-only, one word per row, no overlaps
+- **AI Generation:** Topic-based word lists with intelligent saving and retry logic
 - **Entry Point:** `app.py`
 - **Testing:** pytest with 25/25 tests passing (100%)
+- **Development Time:** ~12.75 hours across 7 sprints (Phase 1) + AI enhancements
 ## Streamlit Components (Implemented in v0.0.2)
 - State & caching ✅
 **Acceptance:** ✅ Types implemented and fully integrated (13/13 tests passing)
+### 2) Word List Management ✅ (Sprint 1, Enhanced in v0.1.0-0.1.1)
 - ✅ English word list filtered to alphabetic uppercase, lengths in {4,5,6}
 - ✅ Loader centralized in `word_loader.py` with caching
 - ✅ Three word lists: classic, fourth_grade, wordlist
+- ✅ **AI word generation** support via `word_loader_ai.py`:
+  - Generates 75 words per topic (25 each of lengths 4, 5, 6)
+  - **Dual generation modes** (v0.1.0+):
+    - HF Space API (primary): Uses Hugging Face Space when `USE_HF_WORDS=true`
+    - Local transformers (fallback): Falls back to local models if HF unavailable
+  - **Intelligent word saving** (v0.1.1):
+    - Smart detection separates existing dictionary words from new AI-generated words
+    - Only saves new words to prevent duplicates
+    - Automatic retry mechanism (up to 3 attempts) if insufficient words generated
+    - 1000-word file size limit prevents dictionary bloat
+    - Auto-sorted by length then alphabetically
+  - **Additional word generation**: Automatically generates more words when MIN_REQUIRED threshold not met
+  - **Enhanced logging**: Detailed pipeline visibility for debugging
 - ✅ Unified loader (`load_word_list_or_ai`) routes between file-based and AI-generated words
 - ✅ Saves new AI-generated words to local files for expansion
+**Acceptance:** ✅ Loading function returns lists by length with >= 25 words per length; AI generation produces valid words with intelligent saving and retry logic
 ### 3) Puzzle Generation (8x6 Horizontal) ✅ (Sprint 2)
 - ✅ Randomly place 6 words on 8x6 grid, one per row
 - 📋 Player statistics
 - 📋 Enhanced UI animations
+### v0.4.0 (AI Expansion)
+- 📋 AI difficulty tuning based on player performance
+- 📋 Custom topic suggestions
+- 📋 Multi-language word generation
+- 📋 Word difficulty analysis and visualization
 ### v1.0.0 (Long Term)
 - 📋 Multiple difficulty levels
 - 📋 Daily puzzle mode
 ---
 **Last Updated:** 2025-01-31
+**Version:** 0.1.1
+**Status:** Production Ready - AI Enhanced 🚀
+## AI Word Generation Pipeline (v0.1.1)
+### Architecture
+```
+User Input (Topic)
+    ↓
+Check USE_HF_WORDS flag
+    ↓
+┌─────────────────────────────────────┐
+│ HF Space API (Primary)              │
+│ - gradio_client integration         │
+│ - Temperature: 0.95                 │
+│ - Max tokens: 512                   │
+└─────────────────────────────────────┘
+    ↓ (if fails or USE_HF_WORDS=false)
+┌──��──────────────────────────────────┐
+│ Local Transformers (Fallback)       │
+│ - Auto model selection              │
+│ - Device auto-detection             │
+│ - Temperature: 0.7                  │
+└─────────────────────────────────────┘
+    ↓
+Parse & Filter Words
+    ↓
+Identify New vs Existing
+    ↓
+Check MIN_REQUIRED threshold
+    ↓ (if insufficient)
+Generate Additional Words (up to 3 retries)
+    ↓
+Save New Words to File
+    ↓
+Validate & Sort File
+    ↓
+Return 75 Words for Game
+```
+### Word Saving Strategy
+1. **Detection Phase**: Separate new AI words from existing dictionary words
+2. **Validation Phase**: Check if file meets MIN_REQUIRED (25 words per length)
+3. **Retry Phase**: If insufficient, generate additional words (up to 3 attempts)
+4. **Save Phase**: Write only new words to topic-based file
+5. **Sort Phase**: Auto-sort by length then alphabetically
+6. **Limit Phase**: Stop adding if file reaches 1000 words
+### Error Handling
+- **HF Space API failure**: Graceful fallback to local model
+- **Model loading failure**: Try multiple models in priority order
+- **Device compatibility**: Retry pipeline without device parameter on error 422
+- **Insufficient words**: Automatic retry with targeted prompts
+- **File operations**: Detailed logging and error recovery
 ## Test File Location
 All test files must be placed in the `/tests` folder. This ensures a clean project structure and makes it easy to discover and run all tests.

specs/specs.md CHANGED Viewed

@@ -1,6 +1,6 @@
 # Wrdler Game Specifications (specs.md)
-**Version:** 0.1.0
-**Status:** All Features Complete - Ready for Deployment
 **Last Updated:** 2025-01-31
 ## Overview
@@ -58,7 +58,22 @@ Wrdler is a simplified vocabulary puzzle game based on BattleWords, but with key
 - ✅ 10 incorrect guess limit per game
 - ✅ Two game modes: Classic (chain guesses) and Too Easy (single guess per reveal)
-## Implemented Features (v0.0.2)
 ### Challenge Mode
 - ✅ **Game ID Sharing:** Each puzzle generates a shareable link with `?game_id=<sid>` to challenge others with the same word list
@@ -175,19 +190,25 @@ HF_REPO_ID/
 ## Development Status
-### Completed (v0.0.2) ✅
-All 7 sprints complete, 100% test coverage (25/25 tests passing):
-- **Sprint 1:** Core data models (rectangular grid support)
-- **Sprint 2:** Puzzle generator (horizontal-only, one-per-row)
-- **Sprint 3:** Radar visualization removal
-- **Sprint 4:** Free letter selection UI
-- **Sprint 5:** Grid UI updates for 8×6 display
-- **Sprint 6:** Integration testing
-- **Sprint 7:** Documentation finalization
-**Development Time:** ~12.75 hours
-**Test Pass Rate:** 100% (25/25 tests)
-**Status:** Ready for deployment! 🚀
 ### Future Roadmap
 - **v0.3.0:** Local persistent storage, high score tracking, player statistics

 # Wrdler Game Specifications (specs.md)
+**Version:** 0.1.1
+**Status:** Production Ready - AI Enhanced
 **Last Updated:** 2025-01-31
 ## Overview
 - ✅ 10 incorrect guess limit per game
 - ✅ Two game modes: Classic (chain guesses) and Too Easy (single guess per reveal)
+## Implemented Features (v0.1.1)
+### AI Word Generation (v0.1.0+)
+- ✅ **Topic-Based Generation:** Create custom word lists for any theme using AI
+- ✅ **Dual Generation Modes:**
+  - HF Space API (primary): Uses Hugging Face Space when `USE_HF_WORDS=true`
+  - Local transformers (fallback): Falls back to local models if HF unavailable
+- ✅ **Intelligent Word Management:**
+  - Smart detection separates existing dictionary words from new AI-generated words
+  - Only saves new words to prevent duplicates in word files
+  - Automatic retry mechanism (up to 3 attempts) if insufficient words generated
+  - 1000-word file size limit prevents dictionary bloat
+  - Auto-sorted by length then alphabetically
+- ✅ **Guaranteed Distribution:** Ensures exactly 25 words each of lengths 4, 5, and 6
+- ✅ **Graceful Fallback:** Uses dictionary words if AI generation fails
+- ✅ **Enhanced Logging:** Detailed pipeline visibility for debugging
 ### Challenge Mode
 - ✅ **Game ID Sharing:** Each puzzle generates a shareable link with `?game_id=<sid>` to challenge others with the same word list
 ## Development Status
+**Current Version:** 0.1.1 (Production Ready - AI Enhanced)
+### Completed ✅
+- **v0.1.1:** Enhanced AI word generation
+  - Intelligent word saving with duplicate prevention
+  - Automatic retry mechanism (up to 3 attempts)
+  - 1000-word file size limit
+  - Improved HF Space API integration
+  - Enhanced logging and error handling
+- **v0.1.0:** AI word generation foundation
+  - Topic-based word list creation
+  - Dual generation modes (HF Space + local)
+  - Utility modules integration
+- **v0.0.2:** All 7 sprints complete
+  - ✅ 100% test coverage (25/25 tests)
+  - 📊 Development time: ~12.75 hours (sprints 1-7)
+  - 📚 Complete documentation
 ### Future Roadmap
 - **v0.3.0:** Local persistent storage, high score tracking, player statistics

tests/test_word_file_validation.py ADDED Viewed

	@@ -0,0 +1,67 @@

+"""
+Test validation of word files for MIN_REQUIRED threshold compliance.
+"""
+import os
+import tempfile
+import shutil
+from wrdler.word_loader_ai import _save_ai_words_to_file
+from wrdler.word_loader import MIN_REQUIRED
+def test_save_ai_words_validates_min_required():
+    """Test that _save_ai_words_to_file returns insufficiency info."""
+    # Create a temporary directory for test files
+    test_dir = tempfile.mkdtemp()
+    try:
+        # Mock the words directory to point to our temp dir
+        import wrdler.word_loader_ai as wl_ai
+        original_dirname = wl_ai.os.path.dirname
+        def mock_dirname(path):
+            if "word_loader_ai.py" in path:
+                return test_dir
+            return original_dirname(path)
+        wl_ai.os.path.dirname = mock_dirname
+        # Test case 1: Insufficient words (should return non-empty dict)
+        insufficient_words = [
+            "COOK", "BAKE", "HEAT",  # 3 x 4-letter (need 25)
+            "ROAST", "GRILL", "STEAM",  # 3 x 5-letter (need 25)
+            "SIMMER", "BRAISE",  # 2 x 6-letter (need 25)
+        ]
+        filename, insufficient = _save_ai_words_to_file("test_topic", insufficient_words)
+        assert filename == "test_topic.txt", f"Expected 'test_topic.txt', got '{filename}'"
+        assert len(insufficient) > 0, "Expected insufficient_lengths to be non-empty"
+        assert 4 in insufficient, "Expected 4-letter words to be insufficient"
+        assert 5 in insufficient, "Expected 5-letter words to be insufficient"
+        assert 6 in insufficient, "Expected 6-letter words to be insufficient"
+        # Test case 2: Sufficient words (should return empty dict)
+        sufficient_words = []
+        for length in [4, 5, 6]:
+            for i in range(MIN_REQUIRED):
+                # Generate unique words of the required length
+                word = chr(65 + (i % 26)) * length + str(i).zfill(length - 1)
+                sufficient_words.append(word[:length].upper())
+        filename2, insufficient2 = _save_ai_words_to_file("test_sufficient", sufficient_words)
+        assert filename2 == "test_sufficient.txt", f"Expected 'test_sufficient.txt', got '{filename2}'"
+        assert len(insufficient2) == 0, f"Expected empty insufficient_lengths, got {insufficient2}"
+        print("? All validation tests passed!")
+    finally:
+        # Restore original dirname
+        wl_ai.os.path.dirname = original_dirname
+        # Clean up temp directory
+        shutil.rmtree(test_dir, ignore_errors=True)
+if __name__ == "__main__":
+    test_save_ai_words_validates_min_required()

wrdler/__init__.py CHANGED Viewed

@@ -8,5 +8,5 @@ Key differences from BattleWords:
 - 2 free letter guesses at game start
 """
-__version__ = "0.1.0"
 __all__ = ["models", "generator", "logic", "ui", "word_loader"]

 - 2 free letter guesses at game start
 """
+__version__ = "0.1.1"
 __all__ = ["models", "generator", "logic", "ui", "word_loader"]

wrdler/modules/__init__.py CHANGED Viewed

@@ -4,6 +4,11 @@ Shared utility modules for Wrdler.
 These modules are imported from the OpenBadge project and provide
 reusable functionality for storage, constants, and file utilities.
 """
 from .storage import (

 These modules are imported from the OpenBadge project and provide
 reusable functionality for storage, constants, and file utilities.
+The AI word generation system (word_loader_ai.py) uses these modules for:
+- File operations and path management (file_utils)
+- Storage configuration and HF integration (constants)
+- Remote storage and URL generation (storage)
 """
 from .storage import (

wrdler/word_loader_ai.py CHANGED Viewed

@@ -30,7 +30,8 @@ except Exception:  # pragma: no cover
 # Local imports
 from .word_loader import (
     load_word_list,
-    FALLBACK_WORDS,
     compute_word_difficulties,  # Use current v3 difficulty metric
 )
 from .modules.constants import AI_MODELS, TMPDIR, USE_HF_WORDS, HF_WORD_LIST_REPO_ID
@@ -59,7 +60,7 @@ _USED_MODEL_NAME: Optional[str] = None
 DEFAULT_MODEL_NAME = os.environ.get(
     "WRDLER_AI_MODEL",
-    AI_MODELS[0] if AI_MODELS else "meta-llama/Meta-Llama-3-8B-Instruct"
 )
 # Safety: limit max new tokens to keep latency reasonable (increased to accommodate 75+ words)
@@ -71,7 +72,7 @@ BASE_PROMPT_TEMPLATE = (
     "Return AT LEAST 75 UNIQUE WORDS related to the topic: '{topic}'.\n"
     "FORMAT RULES:\n"
     "- Output ONLY a single comma-separated list (no numbering, no extra text)\n"
-    "- Include at least: 25 words of length 4 letters, 25 words of length 5 letters, 25 words of length 6 letters\n"
     "- Use ONLY uppercase A-Z letters (no diacritics, hyphens, or spaces)\n"
     "- No duplicates. No explanations.\n"
     "List:"
@@ -79,7 +80,7 @@ BASE_PROMPT_TEMPLATE = (
 VALID_LENGTHS = (4, 5, 6)
 RE_WORD = re.compile(r"^[A-Z]+$")
-WORDS_PER_LENGTH = 25  # Target 25 words for each length
 # ---------------------------------------------------------------------------
@@ -101,9 +102,8 @@ def _generate_via_hf_space(topic: str) -> Tuple[str, str]:
     """
     if not _GRADIO_CLIENT_AVAILABLE:
         raise Exception("gradio_client not installed; install with: pip install gradio_client")
-    prompt = BASE_PROMPT_TEMPLATE.format(topic=topic.upper())
     try:
         logger.info(f"🌐 Calling HF Space API: {HF_WORD_LIST_REPO_ID}")
         client = Client(HF_WORD_LIST_REPO_ID)
@@ -131,6 +131,7 @@ def _generate_via_hf_space(topic: str) -> Tuple[str, str]:
 def _load_model(model_name: str = DEFAULT_MODEL_NAME):
     """
     Try to load the requested model first, then fall back through AI_MODELS in order.
     """
     if not _TRANSFORMERS_AVAILABLE:
         logger.warning("⚠️ Transformers not available; falling back to dictionary words.")
@@ -154,12 +155,25 @@ def _load_model(model_name: str = DEFAULT_MODEL_NAME):
                 torch_dtype="auto",
                 device_map="auto" if device == 0 else None,
             )
-            gen = pipeline(
-                "text-generation",
-                model=model,
-                tokenizer=tokenizer,
-                device=device,
-            )
             global _USED_MODEL_NAME
             _USED_MODEL_NAME = current
             logger.info(f"✅ Model loaded successfully: {current}")
@@ -197,7 +211,7 @@ def _extract_words_from_output(prompt: str, raw_output: str) -> List[str]:
 def _enforce_distribution(words: List[str], wordlist_map: Dict[int, List[str]]) -> List[str]:
     """
-    Ensure we have exactly 25 of each required length (4,5,6). Truncate extras.
     Missing slots are filled from dictionary words (wordlist_map), then FALLBACK_WORDS if needed.
     Args:
@@ -205,7 +219,7 @@ def _enforce_distribution(words: List[str], wordlist_map: Dict[int, List[str]])
         wordlist_map: Dictionary of canonical words by length from load_word_list
     Returns:
-        List of exactly 75 words (25 each of lengths 4, 5, 6)
     """
     by_len: Dict[int, List[str]] = {4: [], 5: [], 6: []}
     for w in words:
@@ -213,10 +227,6 @@ def _enforce_distribution(words: List[str], wordlist_map: Dict[int, List[str]])
         if L in by_len and w not in by_len[L]:
             by_len[L].append(w)
-    # Trim to at most 25 each
-    for L in VALID_LENGTHS:
-        by_len[L] = by_len[L][:WORDS_PER_LENGTH]
     # Fill missing using dictionary words, then fallback words if still needed
     for L in VALID_LENGTHS:
         if len(by_len[L]) < WORDS_PER_LENGTH:
@@ -261,6 +271,89 @@ def _filter_and_dedupe(words: List[str]) -> List[str]:
     return result
 def _score_words(full_wordlist_path: Optional[str], words: List[str]) -> Dict[str, float]:
     """
     Use existing difficulty metric for the subset derived.
@@ -276,7 +369,7 @@ def _score_words(full_wordlist_path: Optional[str], words: List[str]) -> Dict[st
         return {}
-def _save_ai_words_to_file(topic: str, words: List[str]) -> str:
     """
     Save AI-generated words to a file in the words folder.
     If the file exists, append new words without duplicates and sort.
@@ -285,9 +378,11 @@ def _save_ai_words_to_file(topic: str, words: List[str]) -> str:
     Args:
         topic: The topic used for generation
         words: List of words to save
     Returns:
-        The filename of the saved file
     """
     from .generator import sort_word_file
@@ -314,7 +409,7 @@ def _save_ai_words_to_file(topic: str, words: List[str]) -> str:
             # Check if file already has 1000+ words
             if len(existing_words) >= 1000:
                 logger.info(f"ℹ️ File {filename} already has {len(existing_words)} words (≥1000). Not adding new words.")
-                return filename
         except Exception as e:
             logger.warning(f"⚠️ Error reading existing file {filename}: {e}")
@@ -357,15 +452,55 @@ def _save_ai_words_to_file(topic: str, words: List[str]) -> str:
                 for word in sorted_words:
                     f.write(f"{word}\n")
             logger.info(f"✅ Successfully saved and sorted {len(all_words)} words to {filename}")
-            return filename
         except Exception as e:
             logger.error(f"❌ Error saving words to {filename}: {e}")
-            return ""
     else:
         logger.info(f"ℹ️ No new words to add to {filename}")
-        return filename
 # ---------------------------------------------------------------------------
@@ -385,7 +520,7 @@ def generate_ai_words(
     Returns:
         words: List[str] - Final 75 words (uppercase A–Z).
-        difficulties: Dict[str,float] - Difficulty scores using compute_word_difficulties().
         metadata: Dict[str,str] - Source / diagnostic info.
     Parameters:
@@ -407,13 +542,14 @@ def generate_ai_words(
     raw_generated_text = ""
     ai_words: List[str] = []
     generation_source = "none"
     # Check if USE_HF_WORDS is enabled
     if USE_HF_WORDS:
         # Try HF Space API first
         try:
             raw_generated_text, generation_source = _generate_via_hf_space(topic)
-            prompt = BASE_PROMPT_TEMPLATE.format(topic=topic.upper())
             parsed = _extract_words_from_output(prompt, raw_generated_text)
             logger.debug(f"Parsed {len(parsed)} words from HF Space output")
             ai_words = _filter_and_dedupe(parsed)
@@ -432,7 +568,7 @@ def generate_ai_words(
         generator = _load_model(model_name or DEFAULT_MODEL_NAME)
         if generator is not None:
-            prompt = BASE_PROMPT_TEMPLATE.format(topic=topic.upper())
             try:
                 logger.info(f"📝 Generating words from local AI model...")
                 outputs = generator(
@@ -462,7 +598,7 @@ def generate_ai_words(
             ai_words = []
     # CORRECT ORDER:
-    # 1. FIRST identify and save new words (before any filtering)
     new_words_to_save: List[str] = []
     if ai_words:
         existing_words = [w for w in ai_words if w in canonical_set]
@@ -470,13 +606,86 @@ def generate_ai_words(
         logger.info(f"📊 Word analysis: {len(ai_words)} total = {len(existing_words)} existing + {len(new_words_to_save)} NEW")
-        # Save the NEW words to expand the dictionary
         if new_words_to_save:
-            saved_filename = _save_ai_words_to_file(topic, new_words_to_save)
-            if saved_filename:
-                logger.info(f"💾 Saved {len(new_words_to_save)} NEW words to {saved_filename}")
-    # 2. THEN apply dictionary filter if requested (for game word selection)
     if use_dictionary_filter and ai_words:
         before_filter = len(ai_words)
         filtered_out_words = [w for w in ai_words if w not in canonical_set]
@@ -494,7 +703,7 @@ def generate_ai_words(
                 else:
                     by_len['other'].append(w)
-            logger.info(f"🚫 Filtered out words NOT in dictionary:")
             for length in [4, 5, 6, 'other']:
                 if by_len[length]:
                     logger.info(f"   {length}-letter: {', '.join(sorted(by_len[length]))}")

 # Local imports
 from .word_loader import (
     load_word_list,
+    FALLBACK_WORDS,
+    MIN_REQUIRED,
     compute_word_difficulties,  # Use current v3 difficulty metric
 )
 from .modules.constants import AI_MODELS, TMPDIR, USE_HF_WORDS, HF_WORD_LIST_REPO_ID
 DEFAULT_MODEL_NAME = os.environ.get(
     "WRDLER_AI_MODEL",
+    AI_MODELS[0] if AI_MODELS else "meta-llama/Llama-3.1-8B-Instruct"
 )
 # Safety: limit max new tokens to keep latency reasonable (increased to accommodate 75+ words)
     "Return AT LEAST 75 UNIQUE WORDS related to the topic: '{topic}'.\n"
     "FORMAT RULES:\n"
     "- Output ONLY a single comma-separated list (no numbering, no extra text)\n"
+    "- Include at least: {WORDS_PER_LENGTH} words of length 4 letters, {WORDS_PER_LENGTH} words of length 5 letters, {WORDS_PER_LENGTH} words of length 6 letters\n"
     "- Use ONLY uppercase A-Z letters (no diacritics, hyphens, or spaces)\n"
     "- No duplicates. No explanations.\n"
     "List:"
 VALID_LENGTHS = (4, 5, 6)
 RE_WORD = re.compile(r"^[A-Z]+$")
+WORDS_PER_LENGTH = MIN_REQUIRED  # Target MIN_REQUIRED words for each length
 # ---------------------------------------------------------------------------
     """
     if not _GRADIO_CLIENT_AVAILABLE:
         raise Exception("gradio_client not installed; install with: pip install gradio_client")
+    prompt = BASE_PROMPT_TEMPLATE.format(topic=topic.upper(), WORDS_PER_LENGTH=WORDS_PER_LENGTH)
     try:
         logger.info(f"🌐 Calling HF Space API: {HF_WORD_LIST_REPO_ID}")
         client = Client(HF_WORD_LIST_REPO_ID)
 def _load_model(model_name: str = DEFAULT_MODEL_NAME):
     """
     Try to load the requested model first, then fall back through AI_MODELS in order.
+    Detect error 422 or 'cannot be moved to a specific device' and retry pipeline without device argument.
     """
     if not _TRANSFORMERS_AVAILABLE:
         logger.warning("⚠️ Transformers not available; falling back to dictionary words.")
                 torch_dtype="auto",
                 device_map="auto" if device == 0 else None,
             )
+            try:
+                gen = pipeline(
+                    "text-generation",
+                    model=model,
+                    tokenizer=tokenizer,
+                    device=device,
+                )
+            except Exception as e:
+                # Detect error 422 or accelerate device error
+                msg = str(e)
+                if "cannot be moved to a specific device" in msg or "422" in msg:
+                    logger.warning(f"⚠️ Retrying pipeline for {current} without device argument due to error: {msg}")
+                    gen = pipeline(
+                        "text-generation",
+                        model=model,
+                        tokenizer=tokenizer,
+                    )
+                else:
+                    raise
             global _USED_MODEL_NAME
             _USED_MODEL_NAME = current
             logger.info(f"✅ Model loaded successfully: {current}")
 def _enforce_distribution(words: List[str], wordlist_map: Dict[int, List[str]]) -> List[str]:
     """
+    Ensure we have at least MIN_REQUIRED (25) of each required length (4,5,6).
     Missing slots are filled from dictionary words (wordlist_map), then FALLBACK_WORDS if needed.
     Args:
         wordlist_map: Dictionary of canonical words by length from load_word_list
     Returns:
+        List of minimum 75 words (25 each of lengths 4, 5, 6)
     """
     by_len: Dict[int, List[str]] = {4: [], 5: [], 6: []}
     for w in words:
         if L in by_len and w not in by_len[L]:
             by_len[L].append(w)
     # Fill missing using dictionary words, then fallback words if still needed
     for L in VALID_LENGTHS:
         if len(by_len[L]) < WORDS_PER_LENGTH:
     return result
+def _generate_additional_words(
+    topic: str,
+    needed_by_length: Dict[int, int],
+    existing_words: set,
+    generator,
+    generation_source: str
+) -> List[str]:
+    """
+    Generate additional words when initial generation didn't produce enough new words.
+    Args:
+        topic: The topic for generation
+        needed_by_length: Dict mapping length to number of words needed
+        existing_words: Set of words that already exist (to avoid duplicates)
+        generator: The AI model generator (None if using HF Space)
+        generation_source: Source identifier for logging
+    Returns:
+        List of newly generated words
+    """
+    # Build targeted prompt requesting specific quantities
+    total_needed = sum(needed_by_length.values())
+    if total_needed == 0:
+        return []
+    length_requirements = ", ".join([
+        f"{count} words of length {length} letters"
+        for length, count in needed_by_length.items()
+        if count > 0
+    ])
+    targeted_prompt = (
+        f"You are an assistant generating words for a word deduction game.\n"
+        f"Generate AT LEAST {total_needed} MORE UNIQUE WORDS related to the topic: '{topic.upper()}'.\n"
+        f"FORMAT RULES:\n"
+        f"- Output ONLY a single comma-separated list (no numbering, no extra text)\n"
+        f"- Include AT LEAST: {length_requirements}\n"
+        f"- Use ONLY uppercase A-Z letters (no diacritics, hyphens, or spaces)\n"
+        f"- No duplicates. No explanations.\n"
+        f"List:"
+    )
+    logger.info(f"📝 Generating {total_needed} additional words: {length_requirements}")
+    additional_words: List[str] = []
+    try:
+        if generation_source == HF_WORD_LIST_REPO_ID and _GRADIO_CLIENT_AVAILABLE:
+            # Use HF Space API
+            client = Client(HF_WORD_LIST_REPO_ID)
+            result = client.predict(
+                message=targeted_prompt,
+                temperature=0.95,
+                max_new_tokens=MAX_NEW_TOKENS,
+                api_name="/chat"
+            )
+            parsed = _extract_words_from_output(targeted_prompt, result)
+            additional_words = _filter_and_dedupe(parsed)
+        elif generator is not None:
+            # Use local model
+            outputs = generator(
+                targeted_prompt,
+                max_new_tokens=MAX_NEW_TOKENS,
+                num_return_sequences=1,
+                temperature=0.8,
+                do_sample=True,
+            )
+            raw_output = outputs[0]["generated_text"]
+            parsed = _extract_words_from_output(targeted_prompt, raw_output)
+            additional_words = _filter_and_dedupe(parsed)
+        # Filter out words that already exist
+        additional_words = [w for w in additional_words if w not in existing_words]
+        logger.info(f"✅ Generated {len(additional_words)} additional unique words")
+        return additional_words
+    except Exception as e:
+        logger.error(f"❌ Failed to generate additional words: {e}")
+        return []
 def _score_words(full_wordlist_path: Optional[str], words: List[str]) -> Dict[str, float]:
     """
     Use existing difficulty metric for the subset derived.
         return {}
+def _save_ai_words_to_file(topic: str, words: List[str]) -> Tuple[str, Dict[int, int]]:
     """
     Save AI-generated words to a file in the words folder.
     If the file exists, append new words without duplicates and sort.
     Args:
         topic: The topic used for generation
         words: List of words to save
     Returns:
+        Tuple of (filename, insufficient_lengths) where:
+        - filename: The filename of the saved file (empty string on error)
+        - insufficient_lengths: Dict mapping length -> shortfall count (empty if all lengths meet MIN_REQUIRED)
     """
     from .generator import sort_word_file
             # Check if file already has 1000+ words
             if len(existing_words) >= 1000:
                 logger.info(f"ℹ️ File {filename} already has {len(existing_words)} words (≥1000). Not adding new words.")
+                return filename, {}  # Return empty dict = no insufficiency
         except Exception as e:
             logger.warning(f"⚠️ Error reading existing file {filename}: {e}")
                 for word in sorted_words:
                     f.write(f"{word}\n")
+            # Validate file now has MIN_REQUIRED words per length
+            words_by_len = {4: [], 5: [], 6: []}
+            for w in sorted_words:
+                L = len(w)
+                if L in words_by_len:
+                    words_by_len[L].append(w)
+            insufficient_lengths = {L: MIN_REQUIRED - len(words_by_len[L])
+                                   for L in (4, 5, 6)
+                                   if len(words_by_len[L]) < MIN_REQUIRED}
+            if insufficient_lengths:
+                logger.warning(
+                    f"⚠️ File {filename} still below MIN_REQUIRED threshold: "
+                    f"{', '.join(f'{L}-letter: {len(words_by_len[L])}/{MIN_REQUIRED}' for L in insufficient_lengths.keys())}"
+                )
+            else:
+                logger.info(f"✅ File {filename} meets MIN_REQUIRED threshold for all lengths")
             logger.info(f"✅ Successfully saved and sorted {len(all_words)} words to {filename}")
+            return filename, insufficient_lengths
         except Exception as e:
             logger.error(f"❌ Error saving words to {filename}: {e}")
+            return "", {}
     else:
         logger.info(f"ℹ️ No new words to add to {filename}")
+        # Still validate existing file
+        try:
+            sorted_words = sort_word_file(filepath)
+            words_by_len = {4: [], 5: [], 6: []}
+            for w in sorted_words:
+                L = len(w)
+                if L in words_by_len:
+                    words_by_len[L].append(w)
+            insufficient_lengths = {L: MIN_REQUIRED - len(words_by_len[L])
+                                   for L in (4, 5, 6)
+                                   if len(words_by_len[L]) < MIN_REQUIRED}
+            if insufficient_lengths:
+                logger.warning(
+                    f"⚠️ File {filename} still below MIN_REQUIRED threshold: "
+                    f"{', '.join(f'{L}-letter: {len(words_by_len[L])}/{MIN_REQUIRED}' for L in insufficient_lengths.keys())}"
+                )
+            return filename, insufficient_lengths
+        except Exception:
+            return filename, {}
 # ---------------------------------------------------------------------------
     Returns:
         words: List[str] - Final 75 words (uppercase A–Z).
+        difficulties: Dict[str,float] - Difficulty scores using compute_word_difficulty().
         metadata: Dict[str,str] - Source / diagnostic info.
     Parameters:
     raw_generated_text = ""
     ai_words: List[str] = []
     generation_source = "none"
+    generator = None  # Track generator for potential additional generation
     # Check if USE_HF_WORDS is enabled
     if USE_HF_WORDS:
         # Try HF Space API first
         try:
             raw_generated_text, generation_source = _generate_via_hf_space(topic)
+            prompt = BASE_PROMPT_TEMPLATE.format(topic=topic.upper(), WORDS_PER_LENGTH=WORDS_PER_LENGTH)
             parsed = _extract_words_from_output(prompt, raw_generated_text)
             logger.debug(f"Parsed {len(parsed)} words from HF Space output")
             ai_words = _filter_and_dedupe(parsed)
         generator = _load_model(model_name or DEFAULT_MODEL_NAME)
         if generator is not None:
+            prompt = BASE_PROMPT_TEMPLATE.format(topic=topic.upper(), WORDS_PER_LENGTH=WORDS_PER_LENGTH)
             try:
                 logger.info(f"📝 Generating words from local AI model...")
                 outputs = generator(
             ai_words = []
     # CORRECT ORDER:
+    # 1. FIRST identify new words (before any filtering)
     new_words_to_save: List[str] = []
     if ai_words:
         existing_words = [w for w in ai_words if w in canonical_set]
         logger.info(f"📊 Word analysis: {len(ai_words)} total = {len(existing_words)} existing + {len(new_words_to_save)} NEW")
+        # 2. Check if we have MIN_REQUIRED new words for each length
+        new_words_by_length = {4: [], 5: [], 6: []}
+        for w in new_words_to_save:
+            L = len(w)
+            if L in new_words_by_length:
+                new_words_by_length[L].append(w)
+        # Calculate how many more words we need per length
+        needed_by_length = {}
+        for L in VALID_LENGTHS:
+            current_count = len(new_words_by_length[L])
+            if current_count < MIN_REQUIRED:
+                needed_by_length[L] = MIN_REQUIRED - current_count
+                logger.info(f"⚠️ Only {current_count}/{MIN_REQUIRED} new {L}-letter words. Need {needed_by_length[L]} more.")
+        # 3. If we need more words, generate them
+        if needed_by_length:
+            logger.info(f"🔄 Attempting to generate additional words to meet MIN_REQUIRED threshold...")
+            all_existing = canonical_set.union(set(new_words_to_save))
+            additional_words = _generate_additional_words(
+                topic=topic,
+                needed_by_length=needed_by_length,
+                existing_words=all_existing,
+                generator=generator,
+                generation_source=generation_source
+            )
+            if additional_words:
+                # Add additional words to new_words_to_save and ai_words
+                new_words_to_save.extend(additional_words)
+                ai_words.extend(additional_words)
+                # Update counts
+                for w in additional_words:
+                    L = len(w)
+                    if L in new_words_by_length:
+                        new_words_by_length[L].append(w)
+                logger.info(f"✅ Added {len(additional_words)} additional words. New totals:")
+                for L in VALID_LENGTHS:
+                    logger.info(f"   {L}-letter: {len(new_words_by_length[L])} new words")
+            else:
+                logger.warning(f"⚠️ Could not generate additional words. Proceeding with current set.")
+        # 4. Save the NEW words to expand the dictionary
         if new_words_to_save:
+            max_save_retries = 3
+            retry_count = 0
+            while retry_count < max_save_retries:
+                saved_filename, insufficient_lengths = _save_ai_words_to_file(topic, new_words_to_save)
+                if saved_filename:
+                    logger.info(f"💾 Saved {len(new_words_to_save)} NEW words to {saved_filename}")
+                # If file meets MIN_REQUIRED or we've exhausted retries, break
+                if not insufficient_lengths or retry_count >= max_save_retries - 1:
+                    break
+                # File still insufficient - generate more words for the missing lengths
+                logger.info(f"🔄 File {saved_filename} needs more words. Retry {retry_count + 1}/{max_save_retries}")
+                # Generate additional words to fill the gap
+                additional_fill_words = _generate_additional_words(
+                    topic=topic,
+                    needed_by_length=insufficient_lengths,
+                    existing_words=canonical_set.union(set(new_words_to_save)),
+                    generator=generator,
+                    generation_source=generation_source
+                )
+                if additional_fill_words:
+                    logger.info(f"✅ Generated {len(additional_fill_words)} words to fill file gap")
+                    new_words_to_save.extend(additional_fill_words)
+                    retry_count += 1
+                else:
+                    logger.warning(f"⚠️ Could not generate additional words to fill file. Stopping retries.")
+                    break
+    # 5. THEN apply dictionary filter if requested (for game word selection)
     if use_dictionary_filter and ai_words:
         before_filter = len(ai_words)
         filtered_out_words = [w for w in ai_words if w not in canonical_set]
                 else:
                     by_len['other'].append(w)
+            logger.info(f"🚫 Filtered out words ALREADY in dictionary:")
             for length in [4, 5, 6, 'other']:
                 if by_len[length]:
                     logger.info(f"   {length}-letter: {', '.join(sorted(by_len[length]))}")

wrdler/words/cooking.txt ADDED Viewed

	@@ -0,0 +1,180 @@

+# AI-generated word list
+# Topic: COOKing
+# Last updated: 2025-11-28 12:16:45
+# Total words: 174
+# Format: one word per line, sorted by length then alphabetically
+#
+BAKE
+BEEN
+BLUE
+BOIL
+BUFF
+CAKE
+CALV
+CARD
+COOK
+FARM
+FAST
+FILL
+FIRE
+FISH
+FIZZ
+FOOD
+FRYD
+HARV
+HERB
+JUST
+KEEN
+KERN
+KING
+MAKE
+MANU
+MASH
+MEAT
+MINI
+PICK
+PLAN
+POTS
+POUR
+PREP
+PUFF
+RACK
+RECT
+ROLL
+SEAR
+SEEK
+SIFT
+SIMP
+SKIL
+SKIP
+SLOW
+SOFT
+SOUP
+STEW
+STIR
+TAKE
+THAW
+TURK
+VEAL
+WARM
+WASH
+WELL
+WHIP
+WRAP
+BACON
+BAKED
+BASTE
+BLANK
+BLEND
+BLTIG
+BOATS
+BOILD
+BREAD
+BROWN
+BUTTI
+CANDY
+CARBS
+CHICK
+CHILL
+CLEAN
+CRUMB
+CRUST
+DRIED
+FLAKY
+FRIED
+FRUIT
+FRYER
+GLAZE
+GRAIN
+GRASS
+GRILL
+HARVE
+HERBS
+HOUSE
+KITCH
+KNEAD
+MARIN
+MIXED
+MIXER
+PASTA
+PLATE
+PREPS
+PUREE
+QUICK
+ROAST
+RUBED
+SALAD
+SALTY
+SAUTE
+SCONE
+SHAVE
+SHRIL
+SLICE
+SLICK
+SMAKE
+SPICE
+SPICY
+SPIRE
+START
+STEAK
+STICK
+STIRD
+STIRP
+STOVE
+SWEET
+TASTE
+TASTY
+TEMPP
+THINK
+TOAST
+TREAT
+TRIMM
+TWIST
+UNDER
+VEGET
+WHISK
+WIELD
+YIELD
+ZESTY
+ASSERT
+BARELY
+BARKED
+BAROBA
+BATTER
+BAUGHT
+BAZAAR
+BITTER
+BOILED
+BROWNY
+CARROT
+CHEESE
+COOKED
+COOKER
+COOKIN
+DETAIL
+DILUTE
+EATING
+FLOURD
+FLYING
+FROZEN
+GRATED
+GRILLD
+KNEADD
+LITTLE
+MARINA
+MASHED
+METHOD
+MUFFIN
+NOODLE
+PACKED
+PICKED
+PIERCE
+RECIPE
+SHIELD
+SMELLY
+SMOOTH
+SNAKES
+SPICED
+TASTES
+TOUCHS
+VEGGIE

wrdler/words/english.txt CHANGED Viewed

@@ -1,7 +1,7 @@
 # AI-generated word list
-# Topic: EnglisH
-# Last updated: 2025-11-24 09:23:12
-# Total words: 336
 # Format: one word per line, sorted by length then alphabetically
 #
 ALMS
@@ -15,6 +15,7 @@ BASE
 BEAN
 BELL
 BEND
 BORE
 BORN
 BUNK
@@ -49,9 +50,11 @@ GIRD
 GIVE
 GLOB
 HALE
 HONE
 HOSE
 IRON
 LAKE
 LAND
 LANE
@@ -70,7 +73,9 @@ MINE
 NAIL
 OILY
 PACE
 PAIN
 POET
 PORE
 PORT
@@ -92,9 +97,11 @@ SLIT
 SLUG
 SORE
 STEM
 TAGS
 TAIL
 TALE
 TIME
 TYPE
 WALK
@@ -104,6 +111,7 @@ WISH
 WORD
 YELL
 ZONE
 ALICE
 ALIKE
 BANKS
@@ -111,6 +119,7 @@ BEAST
 BLANK
 BLIND
 BOAST
 BRAID
 BRAKE
 BRASH
@@ -207,6 +216,7 @@ LOYAL
 LUNCH
 LURID
 LURKY
 MAIZE
 MANTO
 MAPLE
@@ -224,6 +234,7 @@ NODES
 NURSE
 OMITS
 OUTGO
 PEERS
 PINKO
 PINTS
@@ -320,9 +331,13 @@ STAIR
 STALK
 STICK
 SUNNY
 VOICE
 WAIST
 WASTE
 AUTHOR
 BEACON
 BRIDGE
@@ -336,7 +351,13 @@ GENTLE
 HAMMER
 ISLAND
 LENGTH
 PRAISE
 REASON
 REFORM
 REYLES

 # AI-generated word list
+# Topic: English
+# Last updated: 2025-11-28 10:56:13
+# Total words: 357
 # Format: one word per line, sorted by length then alphabetically
 #
 ALMS
 BEAN
 BELL
 BEND
+BOOK
 BORE
 BORN
 BUNK
 GIVE
 GLOB
 HALE
+HELP
 HONE
 HOSE
 IRON
+JUMP
 LAKE
 LAND
 LANE
 NAIL
 OILY
 PACE
+PAGE
 PAIN
+PENS
 POET
 PORE
 PORT
 SLUG
 SORE
 STEM
+STEP
 TAGS
 TAIL
 TALE
+TEXT
 TIME
 TYPE
 WALK
 WORD
 YELL
 ZONE
+ALIAS
 ALICE
 ALIKE
 BANKS
 BLANK
 BLIND
 BOAST
+BOOKS
 BRAID
 BRAKE
 BRASH
 LUNCH
 LURID
 LURKY
+MACRO
 MAIZE
 MANTO
 MAPLE
 NURSE
 OMITS
 OUTGO
+PAGES
 PEERS
 PINKO
 PINTS
 STALK
 STICK
 SUNNY
+TUTOR
 VOICE
 WAIST
 WASTE
+WORDS
+WRITE
+ASSIST
 AUTHOR
 BEACON
 BRIDGE
 HAMMER
 ISLAND
 LENGTH
+LETTER
+PHRASE
 PRAISE
 REASON
 REFORM
 REYLES
+SCRIPT
+SPRINT
+SURVEY
+SYMBOL

wrdler/words/wordlist.txt CHANGED Viewed

@@ -1,5 +1,9 @@
-# Optional: place a large A–Z word list here (one word per line).
-# The app falls back to built-in pools if fewer than 500 words per length are found.
 ABLE
 ACID
 AGED
@@ -268,7 +272,6 @@ MISS
 MODE
 MOOD
 MOON
-MOON
 MORE
 MOST
 MOVE
@@ -287,6 +290,7 @@ NICK
 NINE
 NOSE
 NOTE
 OBEY
 ODDS
 OILY
@@ -481,7 +485,6 @@ WIFE
 WILD
 WILL
 WIND
-WIND
 WINE
 WING
 WIRE
@@ -498,6 +501,7 @@ YELL
 YOGA
 ZERO
 ZONE
 APPLE
 BLAST
 BOARD
@@ -506,40 +510,50 @@ BREAD
 CHAIR
 CHALK
 CHESS
 CLOUD
 CRANE
 DANCE
 EARTH
 FAITH
 FLAME
 FLUTE
 GHOST
 GRAPE
 GRASS
 GREAT
 HEART
-HEART
 LEMON
 LIGHT
 MARCH
 MOUSE
 NURSE
 PANEL
-PANEL
 PLANT
 PRIZE
 QUEST
 RIVER
 SCALE
 SHINE
 SMILE
 STONE
 TIGER
 YOUNG
 BUNDLE
 CANDLE
 CHERRY
 CIRCLE
 DOCTOR
 DOMAIN
 FAMILY
@@ -553,13 +567,20 @@ LADDER
 LAUNCH
 LOGGER
 MARKET
 MOTHER
 ORANGE
 PALACE
 POCKET
 SILVER
 SPIRIT
 STREAM
 THRIVE
 TUNNEL
 WINNER

+# AI-generated word list
+# Topic: wordlisT
+# Last updated: 2025-11-28 12:18:24
+# Total words: 581
+# Format: one word per line, sorted by length then alphabetically
+#
 ABLE
 ACID
 AGED
 MODE
 MOOD
 MOON
 MORE
 MOST
 MOVE
 NINE
 NOSE
 NOTE
+NUNC
 OBEY
 ODDS
 OILY
 WILD
 WILL
 WIND
 WINE
 WING
 WIRE
 YOGA
 ZERO
 ZONE
+ADJAR
 APPLE
 BLAST
 BOARD
 CHAIR
 CHALK
 CHESS
+CLASS
 CLOUD
 CRANE
+CYCLE
 DANCE
 EARTH
 FAITH
+FINAL
 FLAME
 FLUTE
+FORUM
 GHOST
 GRAPE
 GRASS
 GREAT
 HEART
+INDEX
+LABEL
 LEMON
 LIGHT
 MARCH
 MOUSE
 NURSE
 PANEL
 PLANT
 PRIZE
 QUEST
+RATIO
 RIVER
 SCALE
 SHINE
 SMILE
 STONE
 TIGER
+VERBS
+VOICE
 YOUNG
+AUTHOR
 BUNDLE
 CANDLE
 CHERRY
 CIRCLE
+DEVICE
+DIGITS
 DOCTOR
 DOMAIN
 FAMILY
 LAUNCH
 LOGGER
 MARKET
+METHOD
 MOTHER
+OFFSET
 ORANGE
+ORIGIN
 PALACE
+PHRASE
 POCKET
+SENSOR
 SILVER
 SPIRIT
 STREAM
+SYMBOL
+SYSTEM
 THRIVE
 TUNNEL
 WINNER