{"id":13265,"date":"2023-10-27T07:57:59","date_gmt":"2023-10-27T11:57:59","guid":{"rendered":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/?p=13265"},"modified":"2023-10-27T07:57:59","modified_gmt":"2023-10-27T11:57:59","slug":"peak-data","status":"publish","type":"post","link":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/peak-data\/13265\/","title":{"rendered":"Peak Data"},"content":{"rendered":"Originally published in East Wind, Oct 25, 2023. I\u2019m probably not the first person to write about the insane leverage that LLMs confer to engineers, but\u00a0Stack Overflow\u2019s 28% layoff\u00a0really got me thinking about the future of\u00a0human-generated data, especially in the context of a potential\u00a0model collapse\u00a0(whereby \u201cmodels forget the true underlying data distribution\u201d once they are trained on machine-generated data). I explore whether we are at \u201cpeak data\u201d in terms of both the quality and percentage of human-generated data on the internet, how this might affect the efficacy of future AI models, and potential solutions\/product opportunities that exist. For <a href=\"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/peak-data\/13265\/\" class=\"more-link\">(more&hellip;)<\/a>","protected":false},"excerpt":{"rendered":"<p>Originally published in East Wind, Oct 25, 2023. I\u2019m probably not the first person to write about the insane leverage that LLMs confer to engineers, but\u00a0Stack Overflow\u2019s 28% layoff\u00a0really got me thinking about the future of\u00a0human-generated data, especially in the context of a potential\u00a0model collapse\u00a0(whereby \u201cmodels forget the true underlying data distribution\u201d once they are [&hellip;]<\/p>\n","protected":false},"author":72,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":"","_links_to":"","_links_to_target":""},"categories":[11,48],"tags":[879,368,1334,1268,243],"class_list":["post-13265","post","type-post","status-publish","format-standard","hentry","category-industry-news","category-left-hand","tag-ai","tag-artificial-intelligence","tag-data-storage","tag-llm","tag-machine-learning"],"_links":{"self":[{"href":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/wp-json\/wp\/v2\/posts\/13265","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/wp-json\/wp\/v2\/users\/72"}],"replies":[{"embeddable":true,"href":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/wp-json\/wp\/v2\/comments?post=13265"}],"version-history":[{"count":1,"href":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/wp-json\/wp\/v2\/posts\/13265\/revisions"}],"predecessor-version":[{"id":13266,"href":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/wp-json\/wp\/v2\/posts\/13265\/revisions\/13266"}],"wp:attachment":[{"href":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/wp-json\/wp\/v2\/media?parent=13265"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/wp-json\/wp\/v2\/categories?post=13265"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.predictiveanalyticsworld.com\/machinelearningtimes\/wp-json\/wp\/v2\/tags?post=13265"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}