What data are you referring to? The data on Hive is public, anyone can use it simply by setting up an API.
Human data is the most beneficial but humans interacting with synthetic data is helpful also. The value of synthetic data, long term, is hotly debated. Nobody knows if it degrades as it is fed through repeatedly into model training.
That is why I tell people to interact with what is posted, even if synthetic. The responses how to generate context.