Google has reportedly confirmed a massive data leak of around 2,500 internal documents circulating the web. Some of the leaked data concerns the company's search engine ranking algorithm.
Thousands of papers that appeared to have come from Google's internal Content API Warehouse were reportedly released on Github on March 13 by an automated bot named yoshi-code-bot. These documents were made available to SparkToro co-founder Rand Fishkin early this month.
According to the paper released, Google may gather and use information on clicks, Chrome users, and other factors that company spokespeople have previously stated have no bearing on website rankings in Google Search.
For Google employees, the thousands of pages of documents serve as a knowledge repository, but it is unclear what specific data points are used to rank search results; the information may be outdated, used only for training, or gathered but not specifically for search. Furthermore, the records don't disclose how, if at all, various elements are weighted in search.
Search engine optimization (SEO) specialists Rand Fishkin and Mike King published preliminary evaluations of the documents and their contents earlier this week, revealing the presence of the leaked material.
Fishkin's specifics are technical and detailed; developers and SEO specialists will probably be able to understand them better than the average individual. Furthermore, there is no guarantee that Google bases search rankings on the precise data and signals mentioned in the leak.
Instead, as SEO expert Mike King noted in his summary of the documents, the leak describes the information Google gathers from pages, sites, and searches and provides SEO experts with oblique clues about what Google appears to be interested in.
Read Also: Google Play Contains Over 90 Malware-Ridden Apps Installed For More Than 5.5 Million Times
Google SEO vs. AI Overviews
Google's leaked SEO documents follow just a few weeks after the tech giant's search engine was implemented with AI Overviews, another feature that could change how SEO and content monetization work.
AI Overviews, also known as "Search Generative Experience," is a new Google search feature that summarizes web content using its internal LLM (large language models).
According to reports, "AI Overviews" are programmed to appear precisely when Google's algorithm determines the quickest and most effective time to engage a person. This will likely happen when someone plans, strategizes, or engages in intellectual discourse.
Although this functionality sounds fantastic, Google's AI integration has several quality issues. First, it sometimes shows erroneous data and takes a little while to respond.
Google AI Overviews has received negative early reviews. User opinions on Google forums suggest the function is often misleading and unnecessary.
AI Overviews vs. News Publishers
As Google attempted to satisfy user expectations further, News/Media Alliance CEO Danielle Coffey purportedly told sources that the new feature would harm their business, giving even less incentive to click through so that news organizations could monetize their material.
Coffey claims there will be a considerable reduction in the little traffic they currently receive. Additionally, because a prominent search engine is solidifying its market dominance, Coffey's group includes over 2,000 news publishers, taking a hard stand against AI developers employing journalism.
They assert they must follow Google's guidelines once more because their final product uses their material as fuel and directly rivals news content.