Ethereum co-founder Vitalik Buterin has shared a progress update on local AI, highlighting that DeepSeek V4 now has a 2-bit quantized version capable of running within approximately 90 GB of VRAM — hitting around 35 tokens per second on Apple hardware and roughly 7 tokens per second on AMD hardware. The benchmarks signal that capable local inference is no longer a data-centre-only proposition.
Vitalik framed the broader concept under the label "CROPS AI," arguing it should be defined by multi-platform hardware support rather than the narrower "decentralized AI" framing that has dominated the conversation. He described a "CROPS Ethereum access layer" that overlaps directly with this vision, encompassing ZK-based paid remote LLM calls and private Ethereum RPC reads — privacy-preserving primitives that let AI agents interact with Ethereum without exposing user data on-chain.
WuBlockchain