스프링부트에서 자바로 GPT 파인 튜닝하기

지난번에는 스프링 부트에서 GPT API를 연동하고 질문을 던지고 GPT 답변을 받는 방법을 알아보았다.

이번에는 파인 튜닝 데이터를 활용해 특정 작업에 맞춰 조정된 GPT 모델을 만드는 방법을 알아보겠다.

일단 파인튜닝이 무엇인가에 대해 간단히 알아보자.

Fine-tuning은 '미세 조정'이라는 뜻으로 딥 러닝에서 사전 훈련된 모델을 새로운 데이터에 대해 훈련시켜 해당 데이터에 대한 작업에 좀 더 적합하게 조정하는 방식이다.

From Wikipedia, the free encyclopedia Machine learning technique In deep learning, fine-tuning is an approach to transfer learning in which the weights of a pre-trained model are trained on new data.[1] Fine-tuning can be done on the entire neural network,

en.wikipedia.org

즉 GPT 파인 튜닝은 훈련된 GPT 모델에게 새로운 데이터를 제공해 해당 데이터에 적합하게 GPT 모델을 미세 조정한다는 것이다.

이를 위해서는 우선 훈련시킬 데이터를 준비할 필요가 있다.

여기서 파인튜닝을 위한 데이터는 Openai에서 권장하는 데이터 포맷이 있다.

{"messages": [{"role": "system", "content": "Marv is a factual chatbot that is also sarcastic."}, {"role": "user", "content": "What's the capital of France?"}, {"role": "assistant", "content": "Paris, as if everyone doesn't know that already."}]}
{"messages": [{"role": "system", "content": "Marv is a factual chatbot that is also sarcastic."}, {"role": "user", "content": "Who wrote 'Romeo and Juliet'?"}, {"role": "assistant", "content": "Oh, just some guy named William Shakespeare. Ever heard of him?"}]}
{"messages": [{"role": "system", "content": "Marv is a factual chatbot that is also sarcastic."}, {"role": "user", "content": "How far is the Moon from Earth?"}, {"role": "assistant", "content": "Around 384,400 kilometers. Give or take a few, like that really matters."}]}

{"prompt": "<prompt text>", "completion": "<ideal generated text>"}
{"prompt": "<prompt text>", "completion": "<ideal generated text>"}
{"prompt": "<prompt text>", "completion": "<ideal generated text>"}

위 두 포맷이 Openai에서 파인 튜닝을 위한 데이터의 예시로 나타낸 포멧이다.

Fine-tuning - OpenAI API

OpenAI Platform

Explore developer resources, tutorials, API docs, and dynamic examples to get the most out of OpenAI's platform.

platform.openai.com

여기서 사용할 GPT 모델에 따라 데이터셋을 달리 해야 하는데,

내가 사용한 GPT-3.5-turbo 모델같은 경우 Chat Model이기 때문에

prompt 데이터셋이 아닌 message 데이터셋을 적용해야한다고 한다.

실제로 prompt 데이터셋으로 파인 튜닝을 적용하려고 하니 에러 메시지가 전달됐다.

모델에 따라 데이터셋을 활용할 수 있는 사람은 이 글이 필요없기에 처음 파인 튜닝을 진행하는 사람을 기준으로

보편적인 gpt-3.5-turbo모델을 활용한 파인 튜닝을 진행하겠다.

나같은 경우에는 금융 업무를 분류하기 위한 데이터를 만들었다.