Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Question, Can I give openAI some data for it to process so I can use it to my own advantage. Say I want to train it on specific topic of information I've gathered over the years. Can I some how give it that data and then I can use the API to get back out data in a chat or some other forms of questions?

I'm not too familiar with how it works.



You can do this to an extent via fine-tuning, but you will need to do so via one of the other GPT-3 models rather than the ChatGPT API model (`GPT-3.5-turbo`). The latter is not available for fine-tuning.


I wonder if it would be possible to simply fine tune another GPT-3 model, have ChatGPT form the query based on input into ChatGPT's API, then pass that response into a call to the GPT-3 model, and take the response from that, pass it into ChatGPT, and ask for it to parse it and make sense of it.


You could use the embeddings to search for a relevant article or doc/docs then feed those in as input to chatgpt, followed by your question.


Does openAI have models for data to train it with.. data like pricing, locations, products .... Or would I have to use something else? The reason I ask because companies are using it with their own data, like Shopify... sooo it has to be trained somehow.


The docs on fine tuning are excellent: https://platform.openai.com/docs/guides/fine-tuning


Yes, you bring your own data, or you generate data with the model, that works too. I have generated training data with GPT3 and trained smaller HuggingFace models on it.


Any links on how to get started with this?


Depends on what task you have, but look at sbert.net for the student model. Generating data with GPT3 means solving the prompt, also task specific.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: