Data Gen for LLM


Minor Update on Major Developments

Hi everyone!

I’m excited to share a minor update that represents a major improvement for the project. This includes the introduction of a template system that the program will use going forward. Additionally, I'll be generating a local dataset using various models, both locally and through external services, ensuring all data is generated with commercially usable models.

Guidelines for Contributed Content

To maintain a clean dataset, I’ll be using llama-guard to rigorously vet all generated and submitted content, ensuring it adheres to our standards. Converted works will not be accepted unless they come with appropriate licensing permissions.

If you'd like to contribute, please provide a link to your personal dataset in the correct format. All accepted contributions will be consolidated and made available on Hugging Face as the official dataset for this project. This will allow anyone to apply it to their model of choice, supporting openness and accessibility for all.

How You Can Support the Project

If you want to contribute but don’t have a dataset to share, you can also support the project by purchasing the program at its lowest cost. This encourages me to dedicate more of my free time to these fun projects.

Acknowledgments

I want to give proper credit where it's due. If you're contributing a significant amount of your own human-created work, please let me know. All contributions will be properly labeled, indicating the type of content, its location in the dataset, and who contributed it.

Thanks for all your support! Feel free to reach out in the comments if you're interested in contributing.

Files

Comedy_and_Song-Introduction_Generation_Format.md 6.1 kB
Nov 30, 2024

Get LLM_Broadcaster

Download NowName your own price

Leave a comment

Log in with itch.io to leave a comment.