{"id":1377,"date":"2024-11-22T07:04:39","date_gmt":"2024-11-22T07:04:39","guid":{"rendered":"https:\/\/blog.oqtacore.com\/?p=1377"},"modified":"2024-12-24T06:12:39","modified_gmt":"2024-12-24T06:12:39","slug":"googles-big-leak-what-happened","status":"publish","type":"post","link":"https:\/\/oqtacore.com\/blog\/googles-big-leak-what-happened\/","title":{"rendered":"Google&#8217;s Big Leak | What happened?"},"content":{"rendered":"<p>Google hit by major internal documentation leak: Insights, and Secrets Exposed<\/p>\n<p><!--more--><\/p>\n<p><span style=\"font-weight: 400;\">On <strong>May 28, 2024,<\/strong> Google faced a massive <\/span><a href=\"https:\/\/hexdocs.pm\/google_api_content_warehouse\/0.4.0\/api-reference.html\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">leak of internal documentation<\/span><\/a><span style=\"font-weight: 400;\">.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The inner workings of Google&#8217;s search engine are among the most secretive and closely guarded black boxes in the world. And&#8230; they leaked online due to a developer&#8217;s mistake, who confused private and public repositories on GitHub.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">For the past 10 years, there have been no reports of leaks of this magnitude and detail from Google&#8217;s search division.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">C-levels from Sparktoro, iPullRank, and other top SEO firms have showcased leaked documents about how Google\u2019s search ranking works. Well, in reality it\u2019s more complex than that. They leaked various APIs around the search engine, but even from these APIs, much can be inferred. A total of 2500 pages of internal Google Search documentation was made public.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">And naturally, the documents contradict what Google has been saying about site promotion in its search engine for years.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">This article will be useful for everyone who owns a domain or is involved in website promotion; some very interesting secrets were revealed. Let&#8217;s dive in!<\/span><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-1379\" src=\"https:\/\/oqtacore.com\/blog\/wp-content\/uploads\/2024\/11\/11111-1.jpg\" alt=\"\" width=\"1024\" height=\"574\" srcset=\"https:\/\/oqtacore.com\/blog\/wp-content\/uploads\/2024\/11\/11111-1.jpg 1024w, https:\/\/oqtacore.com\/blog\/wp-content\/uploads\/2024\/11\/11111-1-300x168.jpg 300w, https:\/\/oqtacore.com\/blog\/wp-content\/uploads\/2024\/11\/11111-1-768x431.jpg 768w, https:\/\/oqtacore.com\/blog\/wp-content\/uploads\/2024\/11\/11111-1-180x101.jpg 180w, https:\/\/oqtacore.com\/blog\/wp-content\/uploads\/2024\/11\/11111-1-800x448.jpg 800w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<h2><span class=\"ez-toc-section\" id=\"Whats_the_Deal_and_Can_You_Trust_the_Leak\"><\/span><b>What&#8217;s the Deal and Can You Trust the Leak?<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p><span style=\"font-weight: 400;\">Numerous checks through various former and current Googlers indicate that this is not a fake, not a joke, but a very real leak, the investigation of which is now of great concern to all SEO researchers.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Such documentation exists in many Google teams, explaining APIs to help familiarize project members with the available data. This leak coincides in appearance with documentation in public GitHub repositories and Google Cloud API documentation, using the same notation style, formatting, and even the names of processes\/modules\/functions and links.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In short, they leaked instructions for members of the Google search system team.<\/span><\/p>\n<p><em><span style=\"font-weight: 400;\">Apparently, the leak happened from GitHub. Someone accidentally and briefly published documentation in public access, apparently confusing a private Google repository with a public one. While the folder was there, somewhere between March and May 2024, the API documentation got onto Hexdocs and from there, it was downloaded by anyone interested.<\/span><\/em><\/p>\n<p><span style=\"font-weight: 400;\">Those who are into website optimization and promotion have long known that Google persistently denies using user data for ranking and search quality.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Back in 2012, there were rumors that clicks on links in Google Chrome were more important than any others &#8211; because Google could measure such clicks. There was much talk about the search engine using user behavior data collected through Google Chrome and extensions for ranking and promoting sites, and at that time Google officially stated that it did not use them. But despite this, SEO specialists did not believe these statements. As it turned out, they were right.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">How can you be sure of the authenticity of the leak? After all, Google could have abandoned some functions, used others exclusively for testing or internal projects, or even made API functions available that were never used.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Obviously, the search engine changes significantly from year to year, and recent features like AI were not highlighted in this leak. However, the documentation contains references to outdated functions and specific notes to others, indicating that they should no longer be used. This suggests that the functions not marked as outdated were still actively used at the time of the leak in March 2024.<\/span><\/p>\n<h2><span class=\"ez-toc-section\" id=\"Where_Google_Lied\"><\/span><b>Where Google Lied<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p><b>&#8220;We don&#8217;t rank sites by authority&#8221;<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Google claimed they don&#8217;t use &#8220;domain authority.&#8221; What does this mean? The &#8220;Domain Authority&#8221; metric was created by Moz, and it assesses how authoritative a site is overall, primarily based on citations. The metric is based on various factors such as the number and quality of links leading to the site. Moz developed this metric as a way to measure the likelihood of a site ranking well in search results.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Firstly, when Google says they don&#8217;t use &#8220;domain authority,&#8221; they don&#8217;t mean it doesn&#8217;t exist at all. They&#8217;re just saying they don&#8217;t use that specific metric from Moz. And it doesn&#8217;t mean that Google doesn\u2019t have a way to assess the authority of sites. They might just not use the same methodology and algorithms as Moz.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Secondly, Google might not assess the authority of a site for a specific topic. That is, they might not measure how important a site is in a particular field of knowledge or for a specific subject.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">However, this doesn&#8217;t mean that Google doesn&#8217;t assess the quality or importance of sites at all. In fact, they have &#8220;siteAuthority,&#8221; which helps determine how reliable and authoritative a site is. Their metric is based on many factors, like content quality, site structure, and user time on site.<\/span><\/p>\n<p><em><span style=\"font-weight: 400;\">Overall, Google is just playing with words when talking about ways to assess the authority of sites.<\/span><\/em><\/p>\n<p><b>&#8220;We don&#8217;t use clicks for ranking&#8221;<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Another lie from Google &#8211; their claims that they didn\u2019t track clicks for ranking sites.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Recently, the Vice President of Search testified in the AntiTrust trial and revealed interesting things. Specifically, he talked about the ranking systems Glue and NavBoost.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">NavBoost is based on clicks. This system analyzes how often users click on certain search results. If a link gets clicked frequently, it might indicate that it&#8217;s more relevant and useful to users. Based on the collected data, NavBoost can boost or lower the positions of results in the search output.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Glue, on the other hand, is not related to links but to content. It includes news, images, videos, and everything else. Glue can also analyze behavior but focuses on interaction with content.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Plus, everyone knows that clicks are the best thing to track site promotion. But even then, it&#8217;s not entirely clear how to promote?<\/span><\/p>\n<p><span style=\"font-weight: 400;\">It&#8217;s all due to Google&#8217;s evasive answers and, let&#8217;s be honest, a lot of complimentary articles about the search engine that just repeat official statements.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Furthermore, the leaked documentation represents users as &#8220;votes.&#8221; Each user click is considered a vote for the relevance of a page. The more clicks, the higher the likelihood that the page is useful. Naturally, all clicks are distributed by geolocation, devices, and other parameters.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Moreover, the search engine through Navboost and Glue tracks the last time a user successfully found the needed information on a page by clicking on it. Therefore, if a page doesn&#8217;t get clicks for a long time, it means it&#8217;s outdated and can be downgraded in the search output.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Additionally, not only the number of clicks is considered, but also the time spent on the page after clicking the link. The longer the user stays on the page, the higher the chance they found something useful.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">But if almost immediately after clicking the link the user closes the page or clicks &#8220;back,&#8221; it means the page was not useful.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">All this is taken into account during ranking.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In the leaked documentation, NavBoost is mentioned 84 times. There is also evidence that they consider its evaluation at the subdomain, root domain, and URL level, which implies different treatment of different levels of the site.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">If Forbes.com\/Cats\/ doesn\u2019t have clicks, it is marked as &#8220;low quality,&#8221; and the link is ignored for analytics. But if Forbes.com\/Dogs\/ has a large volume of clicks, it is marked as &#8220;high quality.&#8221;<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In short, based on the number of clicks the page gets, sites are divided into three categories, for each of which its own &#8220;quality rank&#8221; is built, and more popular sites by clicks bring a greater contribution to PageRank, i.e., are more valuable.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">So yes, Google doesn&#8217;t directly talk about click tracking and data collection from Google Chrome, but there are proofs of the opposite. Google uses clicks and post-click behavior in its ranking algorithms.<\/span><\/p>\n<p><b>&#8220;We don&#8217;t boost sites&#8221;<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Another thing that came out is that there are lists of sites in Google search that are forcibly optimized. Well, traffic is forcibly driven to them, they are at the top of the search. Known only in some topics, such as elections in the states in 2020 or COVID, but there are clearly other topics.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Covid was a very controversial topic, and we still do not know exactly if vaccination did more good or bad.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">But what do people do when they need more info? They&#8230; Google it! And if Google shows all sites, where there are opinions of both sides, that is, both &#8220;right&#8221; and &#8220;wrong,&#8221; it will provoke tension. So they go the easy wat and limit one of the sides in the search results.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Whether it&#8217;s right or not is more of an ethical question.<\/span><\/p>\n<h3><b>\u201cThere is no Sandbox\u201d<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">For years, Google has been saying that the sandbox, into which sites fall by age or lack of signs of trust, does not exist. In case you don\u2019t know, the sandbox is a filter that discards young, newly created sites. Such a filter allows discarding &#8220;junk&#8221; sites that are created purely for promotion or as intermediaries. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">But, good sites also fall into this filter. Conditionally, someone made a one-page site unrelated to the main site for a sales funnel hoping to increase coverage. And here, bam, it doesn\u2019t even show up in the search. How can this be, and what to do, Google said they don\u2019t have a garbage dump! But it turns out they do.<\/span><\/p>\n<h3><b>&#8220;We don&#8217;t do manual work&#8221;<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">And the most interesting: data from EWOK is used directly in the search. This is a system where live people sit and evaluate, for money, which search result is better.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Apparently, there are users who, with their own eyes and opinions, determine which of several sites is better for a particular query.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Therefore, it is important not to underestimate how important it is for quality assessors to perceive and evaluate your sites well. Just like impressing a concierge. Either you go on your business, or you stand in the entrance waiting for someone from the 12th floor to come down to you.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">And another revelation from the leak: Google considers the brand size of a site, not only based on the site itself but in general on the mention of this site on the internet. Even without links. In principle, this was obvious.<\/span><\/p>\n<h2><span class=\"ez-toc-section\" id=\"How_to_Attract_Organic_Traffic_Now\"><\/span><b>How to Attract Organic Traffic Now<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p><span style=\"font-weight: 400;\">So, what to do now?<\/span><\/p>\n<p><span style=\"font-weight: 400;\">If someone asks you: how to attract traffic to your site? Answer boldly, you won&#8217;t go wrong: you need to create a noticeable, popular, well-recognizable brand outside of Google.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The EEAT algorithm is actually not as important as SEO specialists think. As it turned out, the only mention where this EEAT is needed is in reviews on Google Maps. Is it relevant? Find at least ten friends who use Google Maps to find a website.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The real aspects of E-E-A-T, not Google&#8217;s claims, are either hidden, indirect, or so deeply buried that they have little to do with specific elements of ranking and promotion systems.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">When you have a brand, Google starts to perceive you as a separate &#8220;entity,&#8221; not just content or a collection of links in one place. And then all the SEO optimization benefits open up to you. In case you didn\u2019t know, Google defines an entity as &#8220;a thing or concept that is singular, unique, well-defined, and distinguishable.&#8221;<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Building influence as a content author can indeed lead to a higher ranking and rise higher in search. And a little browsing through the search engine shows that there are many powerful brands that rank very well, but they don&#8217;t engage in these rat races of boosting EEAT metrics. Content and links actually take a backseat when there is a clear user intent to find something. Suppose, many people in the center of NY search for &#8220;Joe&#8217;s Burgers&#8221; and scroll through one, two, three pages of results until they find the caf\u00e9 with Joe&#8217;s Burgers, and then go to this site. Pretty quickly, the search engine will understand that this is exactly what people want with this query in this area.<\/span><\/p>\n<p><em><span style=\"font-weight: 400;\">Even if the first search result is a Wikipedia article about the first every Joe who created his first ever burger, it may generate a lot of clicks and views, but it is unlikely to surpass the signals of user intent wanting a burger in the center of Moscow.<\/span><\/em><\/p>\n<p><span style=\"font-weight: 400;\">If you extend this example to the broader network and search in general, then if you can create demand for your site among enough probable searchers in the regions you are targeting, you can bypass the need for classic SEO methods like links, anchor texts, optimized content, and so on.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Navboost and user intent in a specific area are likely now the most powerful ranking factors in the Google system.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Classic ranking factors, however, have not gone anywhere. Though PageRank, anchors, and text matching have been losing their significance for many years, page titles are still very important. The document leak hints that many versions of PageRank have been created and canceled over the years.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In general, anyone who wants to engage in SEO optimization will likely face either low profits, weak traffic, or maybe even work at a loss. You need to build authority in the search engine, citation, navigational demand, and strong reputation among the audience. In short, drive leads to your site by all means, right and wrong.<\/span><\/p>\n<h2><span class=\"ez-toc-section\" id=\"Conclusion\"><\/span><b>Conclusion<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p><span style=\"font-weight: 400;\">Well, to dispel any doubts, Google confirmed the leak. Undoubtedly, this is the most significant leak about Google search in recent years. It turns out that Google often lied in its recommendations, statements regarding the search engine. It\u2019s time to stop believing that content is king, clickbaits, and bot farms are the way to success in SEO.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">What to do now? We&#8217;ve already figured it out: build your brand, gain authority, be a quotable source, and a search target in specific areas. What&#8217;s your SEO strategy? Will you change it now?\u00a0<\/span><\/p>\n<p><i><span style=\"font-weight: 400;\">We hope this article was helpful to you.\u00a0<\/span><\/i><\/p>\n<p><strong>READ MORE:<\/strong><\/p>\n<ul>\n<li><a href=\"https:\/\/oqtacore.com\/blog\/breaking-boundaries-with-figure-01-the-future-of-automation\/\">Figure 01: The Future of Automation<\/a><\/li>\n<li><a href=\"https:\/\/oqtacore.com\/blog\/cloud-costs-optimization-main-changes\/\">Cloud Cost Optimization Over 10 Years: What\u2019s New<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Google hit by major internal documentation leak: Insights, and Secrets Exposed<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_mo_disable_npp":"","yasr_overall_rating":0,"yasr_post_is_review":"","yasr_auto_insert_disabled":"","yasr_review_type":"","footnotes":""},"categories":[1],"tags":[34],"class_list":["post-1377","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-google"],"acf":{"image":1378},"yasr_visitor_votes":{"number_of_votes":0,"sum_votes":0,"stars_attributes":{"read_only":false,"span_bottom":false}},"_links":{"self":[{"href":"https:\/\/oqtacore.com\/blog\/wp-json\/wp\/v2\/posts\/1377","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/oqtacore.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/oqtacore.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/oqtacore.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/oqtacore.com\/blog\/wp-json\/wp\/v2\/comments?post=1377"}],"version-history":[{"count":10,"href":"https:\/\/oqtacore.com\/blog\/wp-json\/wp\/v2\/posts\/1377\/revisions"}],"predecessor-version":[{"id":1472,"href":"https:\/\/oqtacore.com\/blog\/wp-json\/wp\/v2\/posts\/1377\/revisions\/1472"}],"wp:attachment":[{"href":"https:\/\/oqtacore.com\/blog\/wp-json\/wp\/v2\/media?parent=1377"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/oqtacore.com\/blog\/wp-json\/wp\/v2\/categories?post=1377"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/oqtacore.com\/blog\/wp-json\/wp\/v2\/tags?post=1377"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}