Spaces:
Runtime error
Runtime error
| title: LongCePO Chatbot (Sambanova) | |
| emoji: 🤖 | |
| colorFrom: blue | |
| colorTo: green | |
| sdk: gradio | |
| sdk_version: 5.27.1 | |
| app_file: app.py | |
| pinned: false | |
| # LongCePO Chatbot with Sambanova Backend | |
| This is a simple chatbot interface demonstrating the LongCePO (Long-Context Planning and Optimization) method using a Sambanova model (`Llama-4-Maverick-17B-128E-Instruct`) as the backend LLM. | |
| ## How it works | |
| The LongCePO method is designed to handle long contexts (potentially millions of tokens) by: | |
| 1. **Planning:** Decomposing the initial query into sub-questions. | |
| 2. **MapReduce:** Answering each sub-question by processing chunks of the long context, summarizing relevant information, and aggregating results. | |
| This application takes a long text context and a query based on that context. It then uses the modified `longcepo` plugin (originally from the `optillm` repository) to generate an answer using the Sambanova API. | |
| ## How to use | |
| 1. **(Optional)** Enter a system prompt to guide the chatbot's behavior. | |
| 2. Paste the long text document into the **Context** box. | |
| 3. Enter your question based on the provided context into the **Query** box. | |
| 4. Click **Submit**. | |
| The chatbot will process the request using the LongCePO pipeline and display the final answer. | |
| **Note:** Processing long contexts can take some time depending on the length of the context and the complexity of the query. | |
| ## API Key | |
| This application requires a Sambanova API key to function. The key should be stored as a Hugging Face Space Secret named `SAMBANOVA_API_KEY`. | |