SFT # Instruction - [x] https://huggingface.co/datasets/nvidia/HelpSteer - [x] https://huggingface.co/datasets/HuggingFaceH4/no_robots - [x] https://huggingface.co/datasets/migtissera/Synthia-v1.3 - [ ] https://huggingface.co/datasets/teknium/OpenHermes-2.5 - [ ] https://huggingface.co/datasets/Open-Orca/SlimOrca-Dedup - [ ] https://huggingface.co/datasets/garage-bAInd/Open-Platypus - [ ] https://huggingface.co/datasets/kaist-ai/CoT-Collection - [x] https://huggingface.co/datasets/OpenAssistant/oasst_top1_2023-08-25 - [ ] https://huggingface.co/datasets/jondurbin/airoboros-3.1 - [ ] https://huggingface.co/datasets/upaya07/NeurIPS-LLM-data - [ ] https://huggingface.co/datasets/teknium/GPT4-LLM-Cleaned - [ ] https://huggingface.co/datasets/teknium/trismegistus-project - [ ] https://huggingface.co/datasets/stanfordnlp/SHP - [x] https://huggingface.co/datasets/berkeley-nest/Nectar - [x] https://huggingface.co/datasets/HuggingFaceH4/no_robots ## Rollplay - [ ] https://huggingface.co/datasets/PygmalionAI/PIPPA - [ ] https://huggingface.co/datasets/lemonilia/LimaRP?not-for-all-audiences=true # Medical - [ ] https://huggingface.co/datasets/keivalya/MedQuad-MedicalQnADataset - [ ] https://huggingface.co/datasets/AdaptLLM/medicine-tasks - [ ] https://huggingface.co/datasets/zhengyun21/PMC-Patients - [ ] # Chat - [ ] https://huggingface.co/datasets/HuggingFaceH4/ultrachat_200k - [ ] https://huggingface.co/datasets/ehartford/ultrachat-uncensored - [ ] https://huggingface.co/datasets/ehartford/dolphin ## Code - [ ] https://huggingface.co/datasets/theblackcat102/evol-codealpaca-v1 - [ ] https://huggingface.co/datasets/ehartford/dolphin-coder - [ ] https://huggingface.co/datasets/ise-uiuc/Magicoder-OSS-Instruct-75K - [ ] https://huggingface.co/datasets/ise-uiuc/Magicoder-Evol-Instruct-110K ### SQL - [ ] https://huggingface.co/datasets/spider ## Math - [ ] https://huggingface.co/datasets/meta-math/MetaMathQA - [ ] https://huggingface.co/datasets/TIGER-Lab/MathInstruct ## Functional calling - [ ] https://huggingface.co/datasets/glaiveai/glaive-function-calling-v2?row=0 ## DPO - [ ] https://huggingface.co/datasets/berkeley-nest/Nectar - [ ] https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences - [ ] https://huggingface.co/datasets/Anthropic/hh-rlhf - [ ] https://huggingface.co/datasets/unalignment/toxic-dpo-v0.1 - [ ] https://huggingface.co/datasets/Intel/orca_dpo_pairs - [ ] https://huggingface.co/datasets/allenai/ultrafeedback_binarized_cleaned - [ ] https://huggingface.co/datasets/jondurbin/truthy-dpo - [ ] https://huggingface.co/datasets/athirdpath/DPO_Pairs-Roleplay-Alpaca-NSFW?not-for-all-audiences=true ## MULTITURN - [ ] https://huggingface.co/datasets/OpenAssistant/oasst1 - [ ] params: https://huggingface.co/alignment-handbook/zephyr-7b-sft-full https://github.com/huggingface/alignment-handbook https://lightning.ai/pages/community/lora-insights/ https://colab.research.google.com/drive/15iFBr1xWgztXvhrj5I9fBv20c7CFOPBE?usp=sharing#scrollTo=MCD77GZ60DOT # RAG 1. https://huggingface.co/datasets/SciPhi/AgentSearch-V1 Data 1: no_robots + capybara + oasst + helpsteer