Thank you so much for open-sourcing such an outstanding work! I'm encountering challenges in data construction while trying to reproduce the SFT (Supervised Fine-Tuning) training process. Would it be possible for you to share the SFT training dataset used in this work? This would be of great help for me to understand the training details and advance my research. If convenient, I'd be happy to obtain the data through huggingface, mail and strictly comply with the data usage agreement. Once again, thank you for your open-source spirit and generous sharing!
Thank you so much for open-sourcing such an outstanding work! I'm encountering challenges in data construction while trying to reproduce the SFT (Supervised Fine-Tuning) training process. Would it be possible for you to share the SFT training dataset used in this work? This would be of great help for me to understand the training details and advance my research. If convenient, I'd be happy to obtain the data through huggingface, mail and strictly comply with the data usage agreement. Once again, thank you for your open-source spirit and generous sharing!