nrw.social ist einer von vielen unabhängigen Mastodon-Servern, mit dem du dich im Fediverse beteiligen kannst.
Wir sind eine freundliche Mastodon Instanz aus Nordrhein-Westfalen. Ob NRW'ler oder NRW-Sympathifanten, jeder ist hier willkommen.

Serverstatistik:

2,8 Tsd.
aktive Profile

#AITraining

5 Beiträge5 Beteiligte0 Beiträge heute

🚀 Upcoming Demo for Data Science & Generative AI starting soon! 19/04/2025 @8am 1st Enroll now to master AI & data skills
✍️Join link: meet.goto.com/142223645
trainner Name: Mr. Vivek. (16+ Years of Industry Experience)
📅Demo on: 19th April 2025 @ 8:00 AM (IST).
☎️Contact us: +91 7032290546
📲WhatsApp: wa.me/c/917032290546
🌐Visit: visualpath.in/online-data-scie

!!!!! F*ck off Meta !!!!! Meta gab heute bekannt, dass es in Kürze mit dem Training seiner KI-Modelle anhand von Inhalten erwachsener europäischer Nutzer auf seinen Social-Media-Plattformen Facebook und Instagram beginnen wird. Zu den Inhalten, die für das KI-Training verwendet werden, gehören Beiträge und Kommentare erwachsener Nutzer sowie Fragen und Anfragen aus der Interaktion mit dem Meta-KI-Assistenten.

#meta#facebook#instagram

🚀 Master Data Science with Generative AI Only at #Visualpath!
🎯 Hyderabad’s #DataScience Institute is now offering an advanced, industry-ready Data Science with Generative AI Course.
✅ Live Projects with Real-time Scenarios
✅ 15+ Years Industry Expert Trainers
📞 Call Now: +91-7032290546
📲WhatsApp: wa.me/c/917032290546
🌐Visit us: visualpath.in/online-data-scie

"Finally, AI can fact-check itself. One large language model-based chatbot can now trace its outputs to the exact original data sources that informed them.

Developed by the Allen Institute for Artificial Intelligence (Ai2), OLMoTrace, a new feature in the Ai2 Playground, pinpoints data sources behind text responses from any model in the OLMo (Open Language Model) project.

OLMoTrace identifies the exact pre-training document behind a response — including full, direct quote matches. It also provides source links. To do so, the underlying technology uses a process called “exact-match search” or “string matching.”

“We introduced OLMoTrace to help people understand why LLMs say the things they do from the lens of their training data,” Jiacheng Liu, a University of Washington Ph.D. candidate and Ai2 researcher, told The New Stack.

“By showing that a lot of things generated by LLMs are traceable back to their training data, we are opening up the black boxes of how LLMs work, increasing transparency and our trust in them,” he added.

To date, no other chatbot on the market provides the ability to trace a model’s response back to specific sources used within its training data. This makes the news a big stride for AI visibility and transparency."

thenewstack.io/llms-can-now-tr

The New Stack · Breakthrough: LLM Traces Outputs to Specific Training DataAi2’s OLMoTrace uses string matching to reveal the exact sources behind chatbot responses
#AI#GenerativeAI#LLMs

AFP: Authors hold London protest against Meta for ‘stealing’ work to train AI. “Around 100 authors on Thursday protested outside the London headquarters of Meta, accusing the U.S. tech giant of ‘stealing’ content to train its Artificial Intelligence models.”

https://rbfirehose.com/2025/04/04/afp-authors-hold-london-protest-against-meta-for-stealing-work-to-train-ai/

#activism#ai#aitraining

The Conversation: Africa’s data workers are being exploited by foreign tech firms – 4 ways to protect them. “Since 2015, we have been studying the central role of African data workers in building and maintaining artificial intelligence (AI) systems, acting as ‘data janitors’. Our research found that companies rarely acknowledge the use of human workers in AI value chains, thus they […]

https://rbfirehose.com/2025/04/01/the-conversation-africas-data-workers-are-being-exploited-by-foreign-tech-firms-4-ways-to-protect-them/

#africa#ai#aitraining

Ars Technica: Open Source devs say AI crawlers dominate traffic, forcing blocks on entire countries. “Software developer Xe Iaso reached a breaking point earlier this year when aggressive AI crawler traffic from Amazon overwhelmed their Git repository service, repeatedly causing instability and downtime. Despite configuring standard defensive measures—adjusting robots.txt, blocking known […]

https://rbfirehose.com/2025/03/26/ars-technica-open-source-devs-say-ai-crawlers-dominate-traffic-forcing-blocks-on-entire-countries/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · Ars Technica: Open Source devs say AI crawlers dominate traffic, forcing blocks on entire countries | ResearchBuzz: Firehose
Mehr von ResearchBuzz: Firehose
#ai#aitraining#aiassisted

TorrentFreak: Meta’s BitTorrent Uploads of ‘Pirate Library’ Data Equaled 30% of Downloads, Expert Says. “A lawsuit filed by several authors against Meta centers on Meta’s alleged use of pirated books for AI training data and the technical details of BitTorrent which was used to obtain them. Yesterday, Meta filed a motion for summary judgment, while countering the authors’ request to […]

https://rbfirehose.com/2025/03/26/torrentfreak-metas-bittorrent-uploads-of-pirate-library-data-equaled-30-of-downloads-expert-says/

#ai#aitraining#bookpiracy

The Society of Authors: The LibGen data set – what authors can do. “The Atlantic published a searchable database of over 7.5 million books and 81 million research papers. This data set, called Library Genesis or ‘LibGen’ for short, is full of pirated material, and all of it has been used to develop AI systems by tech giant Meta.” FIVE of my books are in this data set. Do you think I […]

https://rbfirehose.com/2025/03/23/the-society-of-authors-the-libgen-data-set-what-authors-can-do/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · The Society of Authors: The LibGen data set – what authors can do | ResearchBuzz: Firehose
Mehr von ResearchBuzz: Firehose
#ai#aitraining#books

MIT Press: A note on LibGen and the unauthorized use of our authors’ work. “We want to be clear: The MIT Press has not licensed any of our books or journal articles for LLM training purposes, nor have we granted permission for any such use. However, we are well aware that many MIT Press publications have ended up in pirated training data sets. We share the deep distress of our authors whose […]

https://rbfirehose.com/2025/03/22/mit-press-a-note-on-libgen-and-the-unauthorized-use-of-our-authors-work/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · MIT Press: A note on LibGen and the unauthorized use of our authors’ work | ResearchBuzz: Firehose
Mehr von ResearchBuzz: Firehose
#ai#aitraining#books

TechCrunch: Bluesky users debate plans around user data and AI training. “Social network Bluesky recently published a proposal on GitHub outlining new options it could give users to indicate whether they want their posts and data to be scraped for things like generative AI training and public archiving.”

https://rbfirehose.com/2025/03/17/techcrunch-bluesky-users-debate-plans-around-user-data-and-ai-training/

#ai#aitraining#bluesky