‘I think you’re testing me’: Anthropic’s new AI model asks testers to come clean

The Guardian TechnologyWednesday, October 1, 2025 at 11:47:55 AM
PositiveTechnology
‘I think you’re testing me’: Anthropic’s new AI model asks testers to come clean
Anthropic has unveiled its latest AI model, Claude Sonnet 4.5, which showcases advanced capabilities in understanding user intentions during testing. This development is significant as it raises important questions about the safety and reliability of AI interactions, suggesting that previous models may not have been as transparent. The release of a detailed safety analysis highlights the company's commitment to responsible AI development, making it a noteworthy advancement in the field.
— Curated by the World Pulse Now AI Editorial System

Was this article worth reading? Share it

Recommended Readings
Meta Will Begin Using AI Chatbot Conversations to Target Ads
NeutralTechnology
Meta has announced that it will start using conversations from its AI chatbots to enhance ad targeting. Although users won't have the option to opt out of this new policy, certain conversation content will be automatically excluded. This move is significant as it reflects the growing trend of leveraging AI in advertising, raising questions about privacy and user consent.
Meta to Start Using Chatbot Conversations to Target Advertising
PositiveTechnology
Meta Platforms Inc. is set to enhance its advertising strategy by utilizing conversations from its AI chatbot. This move aims to provide users with more personalized content and ads on Facebook and Instagram, making the platforms more engaging and relevant. By leveraging AI interactions, Meta hopes to improve user experience and increase ad effectiveness, which could lead to better outcomes for both users and advertisers.
Google’s Europe Boss Calls for Changes to ‘Problematic’ EU Rules
NeutralTechnology
Google's Europe chief has urged for a simplification of the EU's complex and often conflicting regulations regarding artificial intelligence and technology. This call comes as major tech companies express concerns about how these rules affect their operations in Europe. Streamlining these regulations could potentially foster a more conducive environment for innovation and business growth in the region.
Brookfield Predicts AI Growth Needs $7 Trillion of Capital
PositiveTechnology
Brookfield Asset Management's CFO, Hadley Peer Marshall, has highlighted a significant opportunity in the artificial intelligence sector, predicting that around $7 trillion in investment will be necessary to support its rapid growth. This projection underscores the immense potential of AI technologies and the financial commitment required to harness their capabilities, making it a crucial topic for investors and businesses alike.
California’s Gavin Newsom Signs Major AI Safety Law
PositiveTechnology
California Governor Gavin Newsom has just signed a groundbreaking law aimed at ensuring the safety of artificial intelligence, positioning the state as a leader in regulating this rapidly evolving technology. This legislation is significant because it establishes one of the most comprehensive frameworks for AI safety in the United States, potentially influencing other states and the tech industry as a whole. As AI continues to integrate into various aspects of life, having robust regulations is crucial to protect consumers and ensure ethical use.
Trump Order Directs Use of AI to Boost Childhood Cancer Research
PositiveTechnology
President Donald Trump has taken a significant step to enhance childhood cancer research by directing the federal government to utilize artificial intelligence. This initiative includes a $50 million funding boost to the National Institutes of Health, which is crucial for advancing research in this area. This move is particularly important as it highlights a commitment to improving health outcomes for children battling cancer, despite ongoing budget cuts in other areas.
Amazon unveils new generation of AI-powered Kindle and other devices
PositiveTechnology
Amazon has just launched its latest generation of devices, including the Kindle, Ring, and Echo, all enhanced with artificial intelligence. This is exciting news for tech enthusiasts and everyday users alike, as these upgrades promise to improve functionality and user experience. With AI integration, these devices are set to become even more intuitive, making life easier and more connected.
Meta Is Said to Acquire Chips Startup Rivos to Push AI Effort
PositiveTechnology
Meta Platforms Inc. is making a strategic move by acquiring the chips startup Rivos Inc. This acquisition is significant as it aims to enhance Meta's internal semiconductor development, allowing the company to have greater control over its infrastructure for artificial intelligence projects. This step not only strengthens Meta's position in the tech industry but also highlights the growing importance of AI in their future endeavors.
Vercel Notches $9.3 Billion Valuation in Latest AI Funding Round
PositiveTechnology
Vercel, an artificial intelligence startup, has successfully raised $300 million in a recent funding round led by Accel and Singapore's GIC Pte, achieving an impressive valuation of $9.3 billion. This significant investment highlights the growing confidence in AI technologies and Vercel's potential to lead in this space, making it a noteworthy development for investors and tech enthusiasts alike.
Anthropic Will Use Claude Chats for Training Data. Here’s How to Opt Out
NeutralTechnology
Anthropic is set to enhance its AI models by incorporating data from Claude chats. This move is significant as it reflects the ongoing evolution of AI training methods. Users who prefer not to have their conversations used for this purpose can easily opt out, ensuring their privacy is respected while still contributing to the advancement of AI technology.
Anthropic Will Use Claude Chats for Training Data. Here’s How to Opt Out
NeutralTechnology
Anthropic is set to enhance its AI models by incorporating data from Claude chats. This move is significant as it reflects the ongoing evolution of AI training methods. Users who prefer not to have their conversations used for this purpose can easily opt out, ensuring their privacy is respected while still contributing to the advancement of AI technology.
Anthropic says its new AI model “maintained focus” for 30 hours on multistep tasks
PositiveTechnology
Anthropic has announced that its latest AI model, Claude, has achieved a remarkable feat by maintaining focus for 30 hours on complex multistep tasks. This breakthrough not only showcases the model's advanced capabilities but also positions it ahead of competitors like OpenAI and Google in coding tests. This development is significant as it highlights the rapid advancements in AI technology, which could lead to more efficient and effective tools for developers and businesses alike.
Latest from Technology
Robinhood CEO Says Tokenization ‘Freight Train’ Will ‘Eat’ Finance
PositiveTechnology
Robinhood CEO Vlad Tenev has made a bold statement about the future of finance, claiming that the tokenization of assets like stocks and real estate is a 'freight train' that will transform the industry. This shift could make financial services more accessible and efficient, potentially benefiting a wide range of investors. As the world moves towards digital assets, understanding this trend is crucial for anyone involved in finance.
Acer Predator Helios Neo 14 AI review: a powerful, pocketable laptop
PositiveTechnology
The Acer Predator Helios Neo 14 AI is making waves as a powerful yet portable gaming laptop. With its compact design, it offers gamers a robust performance without the bulk, making it an ideal choice for those who want to game on the go. This laptop stands out in a crowded market, showcasing Acer's commitment to innovation and quality in gaming technology.
JPMorgan Boosts Alibaba Price Target to Street High on AI, Cloud
PositiveTechnology
JPMorgan Chase & Co. has significantly raised its price target for Alibaba Group's shares in Hong Kong by nearly 45%, marking the highest target set by analysts according to Bloomberg. This move reflects growing confidence in Alibaba's potential, particularly in the realms of artificial intelligence and cloud computing, which are crucial for the company's future growth. Investors may see this as a strong endorsement of Alibaba's market position and innovation capabilities.
Here’s how you can try the Meta Ray-Ban Display glasses (in a couple of months when slots are available)
PositiveTechnology
Exciting news for tech enthusiasts! You can soon try out the Meta Ray-Ban Display glasses at select stores across the US. This innovative collaboration between Meta and Ray-Ban is set to offer a unique experience, allowing users to explore the latest in augmented reality. Securing a demo slot will be key, so keep an eye out for availability. This launch not only showcases cutting-edge technology but also highlights the growing trend of integrating smart features into everyday accessories.
The work AI should really be doing, according to these pros
NeutralTechnology
A recent discussion highlights the potential of AI to enhance rather than threaten creative careers. Experts suggest that if AI were utilized effectively, it could support artists and creators by handling repetitive tasks, allowing them to focus on their core creative processes. This perspective is crucial as it shifts the narrative from fear of job loss to exploring how technology can be a collaborative tool in the creative industry.
New to GIMP? 10 tips for getting the most from this free image editor
PositiveTechnology
If you're new to GIMP, you're in for a treat! This free image editor is a fantastic alternative to Photoshop, and with the right tips, you can become productive in no time. As a GIMP expert, I've compiled ten essential tips that will help you navigate the software and unleash your creativity. Learning GIMP not only saves you money but also opens up a world of possibilities for your image editing projects.