Sort:  

So let me ask, now that AI generates mass data for self training, will people be stealing this data 🤔 and who are they going to sell that too?

Which data do you think is more valuable, quality Human data or quality AI regenerated data

What data are you referring to? The data on Hive is public, anyone can use it simply by setting up an API.

Human data is the most beneficial but humans interacting with synthetic data is helpful also. The value of synthetic data, long term, is hotly debated. Nobody knows if it degrades as it is fed through repeatedly into model training.

That is why I tell people to interact with what is posted, even if synthetic. The responses how to generate context.