Washington (AP): Artificial intelligence researchers have said they deleted more than 2,000 web links to suspected child sexual abuse imagery from a dataset used to train popular AI image-generator tools.
The LAION research dataset is a huge index of online images and captions that's been a source for leading AI image-makers such as Stable Diffusion and Midjourney.
But a report last year by the Stanford Internet Observatory found it contained links to sexually explicit images of children, contributing to the ease with which some AI tools have been able to produce photorealistic deepfakes that depict children.
That December report led LAION, which stands for the nonprofit Large-scale Artificial Intelligence Open Network, to immediately remove its dataset. Eight months later, LAION said in a blog post on Friday that it worked with the Stanford University watchdog group and anti-abuse organisations in Canada and the United Kingdom to fix the problem and release a cleaned-up dataset for future AI research.
Stanford researcher David Thiel, author of the December report, commended LAION for significant improvements but said the next step is to withdraw from distribution the “tainted models” that are still able to produce child abuse imagery.
One of the LAION-based tools that Stanford identified as the “most popular model for generating explicit imagery” - an older and lightly filtered version of Stable Diffusion - remained easily accessible until Thursday, when the New York-based company Runway ML removed it from the AI model repository Hugging Face. Runway said in a statement Friday it was a “planned deprecation of research models and code that have not been actively maintained”.
The cleaned-up version of the LAION dataset comes as governments around the world are taking a closer look at how some tech tools are being used to make or distribute illegal images of children.
San Francisco's city attorney earlier this month filed a lawsuit seeking to shut down a group of websites that enable the creation of AI-generated nudes of women and girls. The alleged distribution of child sexual abuse images on the messaging app Telegram is part of what led French authorities to bring charges on Wednesday against the platform's founder and CEO, Pavel Durov.
Durov's arrest “signals a really big change in the whole tech industry that the founders of these platforms can be held personally responsible,” said David Evan Harris, a researcher at the University of California, Berkeley who recently reached out to Runway asking about why the problematic AI image-generator was still publicly accessible. It was taken down days later.
Let the Truth be known. If you read VB and like VB, please be a VB Supporter and Help us deliver the Truth to one and all.
Mumbai (PTI): Royal Challengers Bengaluru skipper Rajat Patidar, Phil Salt and Virat Kohli blasted half-centuries as the defending champions beat Mumbai Indians by 18 runs in an Indian Premier League match here on Sunday.
Salt (78 off 36 balls) and Kohli (50 off 38 balls) stitched together a 120-run stand for the opening wicket before Patidar scored a rapid 53 off just 20 balls as RCB posted 240 for 4.
In response, Mumbai Indians were restricted to 222 for 5, with RCB spinner Suyash Sharma (2/47) putting the skids on the home side with a double strike in the eighth over, from which they could not recover.
Sherfane Rutherford top-scored for MI with an unbeaten 71 off 31 balls.
While opener Rohit Sharma appeared to be struggling with a hamstring issue and had to retire hurt on 19, his partner Ryan Rickelton made 37, while Suryakumar Yadav (33) and Hardik Pandya (40) were the other contributors for MI.
Brief scores:
Royal Challengers Bengaluru 240 for 4 in 20 overs (Phil Salt 78, Virat Kohli 50, Rajat Patidar 53, Tim David 35 not out).
Mumbai Indians: 222 for 5 in 20 overs (Sherfane Rutherford 71 not out, Ryan Rickelton 37, Hardik Pandya 40; Suyash Sharma 2/47).
