• About
  • Advertise
  • Contact
Monday, June 2, 2025
  • Login
No Result
View All Result
NEWSLETTER
The NY Journals
  • Home
  • Business
  • Technology
  • Entertainment
  • Sports
  • Lifestyle
  • Health
  • Politics
  • Trending
  • Home
  • Business
  • Technology
  • Entertainment
  • Sports
  • Lifestyle
  • Health
  • Politics
  • Trending
No Result
View All Result
The NY Journals
No Result
View All Result
Home Technology

Google can train search AI with web content even after opt-out

by Sarkiya Ranen
in Technology
Google can train search AI with web content even after opt-out
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


[NEW YORK] Google can train its search-specific artificial intelligence (AI) products, such as AI Overviews, on content across the web even when the publishers have chosen to opt out of training Google’s AI products, a vice-president of product at the company testified in court on Friday. 

That’s because Google’s controls for publishers to opt out of AI training only cover work by Google DeepMind, the company’s AI lab, and not any other organisation at the company, said Eli Collins, a Google DeepMind vice-president. 

“Once you take the Gemini” AI model “and put it inside the search org, the search org has the ability to train on the data that publishers had opted out of training, correct?” asked Diana Aguilar, a Department of Justice lawyer.

“Correct – for use in search,” Collins responded. 

Google summarises answers to search queries using its AI at the top of results, which may result in users not clicking on independent websites for answers – a trend that’s hurting their revenue, website publishers said. Google is using data from those same sites to generate the information powering AI answers.

Google called Collins to the witness stand as part of a three-week trial in federal court in Washington, held to determine how Google should restore competition to online search. Last year, US District Judge Amit Mehta ruled that the tech giant illegally monopolised the search market and is now weighing a set of changes proposed by antitrust enforcers to address its control.

BT in your inbox

Start and end each day with the latest news stories and analyses delivered straight to your inbox.

The Justice Department is urging the court to force Google to sell its widely used Chrome browser and to share key data it uses to generate search results. The agency is also asking Judge Mehta to bar Google from paying to be the default search engine on other apps and devices – a restriction that would extend to its AI offerings, including Gemini, which the government argues have benefited from the company’s unlawful dominance in search.

Aguilar, the DOJ lawyer, asked Collins whether he knew how much more additional data Google’s search organisation had access to beyond the content that Google DeepMind had trained its AI models on. When Collins answered that he did not know, Aguilar produced a document from August 26, 2024 titled Search GenAI <> Gemini v3.

According to that document, Google removed 80 billion of 160 billion “tokens” – snippets of content – after filtering out the material that publishers had opted out of allowing Google to use for training its AI. The document also listed search “sessions data”, or data collected during a period of time in which a user interacted with Google Search, as well as YouTube videos, as data that could augment Google’s AI models.

After viewing the document, Mehta asked Collins for clarification. “The 80 billion out of 160 billion tokens, 50 per cent is removed by publishers opting out?”

“That is correct,” Collins responded.

Later, Google’s lawyer sought to show that the tech company’s dominance of search did not prevent other AI companies from competing fiercely to provide accurate, real-time results within their chatbot services. If a user asked an AI chatbot for sports scores, for example, Collins testified that the chatbot would likely return the correct answer because the company that made the bot had a commercial arrangement with a sports score provider – it wouldn’t need to rely on a web index. 

But Google has explored how its AI models could be much improved by the data it has already gathered through years of operating the world’s most popular search engine, testimony also showed. At another point during the cross-examination of Collins, the DOJ lawyer Aguilar showed the Google vice-president a briefing document meant for Demis Hassabis, chief executive of Google DeepMind.

In a comment, Hassabis had mused about training an unidentified Google AI model with a wealth of search data – including search rankings – to see how much more the AI model was improved by the data, compared to one that wasn’t trained with it.

“Did Google end up building a model using search data?” Aguilar asked Collins.

“Not that I’m aware,” he responded.

“But at least Mr Hassabis has thought it would be interesting to look at?” she pressed.

“Yes,” Collins said. BLOOMBERG



Source link

Tags: ContentGoogleOptOutSearchTrainWeb
Sarkiya Ranen

Sarkiya Ranen

I am an editor for Ny Journals, focusing on business and entrepreneurship. I love uncovering emerging trends and crafting stories that inspire and inform readers about innovative ventures and industry insights.

Next Post
“Will Strike Structures On Indus River”: Pak Minister’s Warning To India

"Will Strike Structures On Indus River": Pak Minister's Warning To India

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recommended

Violence During Final Phase Voting In Bengal, EVM Tossed Into Pond

Violence During Final Phase Voting In Bengal, EVM Tossed Into Pond

1 year ago
Zayn Malik Pays Tribute to One Direction 10 Years After His Exit – E! Online

Zayn Malik Pays Tribute to One Direction 10 Years After His Exit – E! Online

2 months ago

Popular News

    Connect with us

    The NY Journals pride themselves on assembling a proficient and dedicated team comprising seasoned journalists and editors. This collective commitment drives us to provide our esteemed readership with nothing short of the most comprehensive, accurate, and captivating news coverage available.

    Transcending the bounds of New York City to encompass a broader scope, we ensure that our audience remains well-informed and engaged with the latest developments, both locally and beyond.

    NEWS

    • Business
    • Technology
    • Entertainment
    • Sports
    • Lifestyle
    • Health
    • Politics
    • Real Estate
    Instagram Youtube

    © 2025 The New York Journals. All Rights Reserved.

    • About Us
    • Advertise
    • Contact Us
    No Result
    View All Result
    • Home
    • Business
    • Technology
    • Entertainment
    • Sports
    • Lifestyle
    • Health
    • Politics
    • Trending

    Copyright © 2023 The Nyjournals

    Welcome Back!

    Login to your account below

    Forgotten Password?

    Retrieve your password

    Please enter your username or email address to reset your password.

    Log In