Classic Open-AI
don't worry, you can still use this to demolish privacy! just think, if you had a bunch of unstructured data but wanted to selectively peek at all the spicy bits....
don't worry, you can still use this to demolish privacy! just think, if you had a bunch of unstructured data but wanted to selectively peek at all the spicy bits....
That's not just smart, that's genius.
you're absolutely right!
Worth it lmao
Why was the only normal discussion here closed?
By the way, some guy wrote code to use this model not for filtering, but for searching for confidential information. Link
Also, this is just an ordinary classifier...
Why does it have more likes than DeepSeek V4 Flash? Are you actually going to use this model? Does it filter medical data? Legal data? Or anything else at all?
Let’s imagine I want to publish a dataset of my chats (many people in the OSS community praise it like this, calling it a "gift").
But this gift only works in English... Well, some gift.
I won't even mention that anyone can train this model at home.
Datasets for this exist: https://huggingface.co/datasets?sort=trending&search=PII
Models for this already exist: https://huggingface.co/models?sort=trending&search=PII
Nvidia's model is more powerful and even multilingual, and it was originally trained as a classifier. You can try it here.
But OpenAI got the most attention, and I think in this case it's purely because of the name. If they wanted to release a model so that people could publish their chats from different companies and conceal their personal data, they would have stated that and encouraged people to do so. But in reality, this is just one of many OpenAI models that was sitting in their archive.
Maybe just delete this discussion altogether then, since it’s inconvenient for you?
Nah lol It can stay

