"A few months ago running something this capable locally meant serious hardware and serious tradeoffs on quality..."
Anthropic turns off the compute taps for third-party tools and rushes to get more TPUs online.
TurboQuant reduces the working memory needed for vector quantisation without sacrificing accuracy, offering a material reduction in inference costs.
Hackers "enumerated and accessed objects within S3 buckets, terminated production EC2 and RDS instances, and decrypted application keys."
Google is offering $30 million to help Europeans master AI to offset future AI-driven job losses, while calling for more permissive AI regulation. At the Future of Work Forum in Latvia, Google execs announced a new project and corresponding funding to help Europeans meet the AI era. Google's
The API keys Google told you to make public can now be used to exfiltrate data via Gemini or run up usage, says Truffle.