so how many hours for that dataset? Price is important.
Jean Louis
JLouisBiz
·
AI & ML interests
- LLM for sales, marketing, promotion
- LLM for Website Revision System
- increasing quality of communication with customers
- helping clients access information faster
- saving people from financial troubles
Recent Activity
replied to EricFillion's
post about 15 hours ago
Here's how to perform full-parameter fine-tuning for https://huggingface.co/openai/gpt-oss-20b with a single H200 GPU. Only a few lines of code are needed using my new Python package called Eric Transformer.
Article: https://www.vennify.ai/gpt-oss-20b-fine-tune/
@vennify
```
pip install erictransformer
```
```
from erictransformer import EricChat, EricTrainArgs
eric_chat = EricChat(model_name="openai/gpt-oss-20b")
train_args = EricTrainArgs(optim="sgd")
# see the article to learn how to format the data
eric_chat.train("train.jsonl", eval_path="eval.jsonl", args=train_args)
``` replied to EricFillion's
post 1 day ago
Here’s how to perform retrieval-augmented (RAG) with two new open-source Python packages I just released. I included a full article below that provides a step-by-step guide on how to build a vector database with this https://huggingface.co/datasets/wikimedia/wikipedia dump and use it to perform RAG with https://huggingface.co/openai/gpt-oss-20b.
FULL ARTICLE: https://www.vennify.ai/vector-eric-search/
https://huggingface.co/vennify
```
pip install erictransformer ericsearch
```
```
import json
from ericsearch import EricSearch
from erictransformer import EricChat
eric_search = EricSearch()
with open("data.jsonl", "w", encoding="utf-8") as f:
sample_case = {"text": "This contains example data. It should contain at least two sentences."}
f.write(json.dumps(sample_case)+ "\n")
eric_search.train("data.jsonl")
eric_search = EricSearch(data_name="eric_search/")
eric_chat = EricChat(model_name="openai/gpt-oss-20b", eric_search=eric_search)
result = eric_chat("Tell me about artificial intelligence ")
print(result.text)
```