左圖:
右圖:
指令微調:
連續思考微調 (Chain-of-thought finetuning):
多任務指令微調 (Multi-task instruction finetuning):
reference:
or
By clicking below, you agree to our terms of service.
New to HackMD? Sign up