WordPress and Tumblr in Talks to Sell User Content to AI Companies Like OpenAI and MidJourney

New Delhi | Updated: 28-02-2024, 19:04 PM IST

Key points

  1. Automattic, the parent company of WordPress and Tumblr, is considering selling user data from its platforms to AI firms like OpenAI and MidJourney for training purposes, sparking concerns about privacy and consent.
  2. Internal dissent within Automattic has emerged, with employees raising reservations about the inclusion of private content inadvertently scraped for AI training, contrary to the company’s intended practices.
  3. Automattic’s plans to introduce a new feature allowing users to opt out of their data being used for AI training, emphasizing transparency and user control over their content.

 

Companies like OpenAI and MidJourney rely on vast amounts of data to enhance the capabilities of their AI tools. However, the sourcing of this data has sparked debates about privacy and consent, particularly when it involves user-generated content from online platforms.

According to a recent report by Gizmodo, Automattic, the company behind popular blogging platforms WordPress and Tumblr, is exploring the possibility of selling content from its sites to AI firms like MidJourney and OpenAI. While the specifics of the arrangement are still emerging, concerns have been raised about the inclusion of private content inadvertently scraped for AI training, contrary to Automattic’s intended practices.

Internal dissent within Automattic has surfaced, with employees expressing reservations about the handling of user data for AI training purposes. Reports suggest that advertising content not owned by Automattic, including materials from previous campaigns, has also found its way into the training dataset, further complicating the situation.

In response to these concerns, Automattic has reiterated its commitment to user privacy and control. The company has pledged to introduce a new feature that will allow users to opt-out of their data being used for AI training at any time. This move aims to provide users with greater transparency and autonomy over their content.

In a blog post addressing the issue, Automattic outlined its plans to empower users with more control over their content. The company highlighted its efforts to discourage crawling by AI companies, including major tech giants, by default. Additionally, Automattic stated that it would only share public content hosted on WordPress.com and Tumblr from sites that haven’t opted out.

Furthermore, Automattic emphasized its collaboration with AI entities, ensuring alignment with community priorities such as attribution, opt-outs, and user control. The company assured users that all partnerships would adhere to opt-out preferences, with ongoing efforts planned to facilitate the removal of content from past sources and future training as requested by users.

As the debate surrounding data privacy and AI intensifies, Automattic’s proactive measures aim to strike a balance between technological innovation and user protection. The company’s commitment to transparency and user control reflects its dedication to maintaining trust and integrity within its online communities.

 

Also Read : AICRA PARTNERS WITH AGNEL INSTITUTE OF TECHNOLOGY, GOA TO ESTABLISH CENTER OF EXCELLENCE

Also Read : AICRA ANNOUNCES INDIA FIRST STARTUP EXPO CONCLAVE 2024 FROM JUNE 28TH TO 30TH.