You are viewing a single comment's thread from:

RE: LeoThread 2024-10-13 12:37

in LeoFinance3 months ago

2. Data Scarcity and Access Issues

  • Increasing costs: Companies like Shutterstock are charging tens of millions for AI companies to access their archives.
  • Data restrictions: Many websites are nOW blocking AI web scrapers (e.g., over 35% of tOP 1,000 websites block OpenAI's scraper).
  • Quality data scarcity: Around 25% of data from "high-quality" sources has been restricted from major AI training datasets.
  • Future projections: Some researchers (e.g., Epoch AI) predict that developers may run out of accessible training data between 2026 and 2032 if current trends continue.