I wanted the TiinyAI Pocket Lab to be real.
I just finished a five-month painful journey with NVIDIA’s DGX Spark, a $3,999 “personal AI supercomputer” that turned out to be bandwidth-kneecapped to protect NVIDIA’s datacenter business. I wrote about it here. Long story short, NVIDIA hid the one number that mattered, 273 GB/s memory bandwidth, behind impressive-sounding specs like “128GB unified memory!” and “1 PFLOP FP4!” I trusted them, I bought it, I got played and I sold it at a loss.
That was a tough experience that definitely taught me not to trust marketing blindly. After being lied to once, I learnt to look for what’s not on the spec sheet. The numbers they lead with are the ones that might not matter that much, kinda like when a magician is doing misdirection. The numbers they don’t tell you are the ones that matter the most.
So after selling the DGX at a loss I was back on the horse searching for a solution that would allow me to run inference locally. Just like everyone else I wanted the biggest memory pool (ideally) for the lowest cash. That is when I found a startup called TiinyAI claiming they’d built a pocket-sized device that runs a 120-billion-parameter model at 20 tokens per second for $1,299. My first thought? Too good to be true. I opened a notepad, then I opened their website and looked at their marketing photos, their spec sheet and got to work. How does that go again? Fool me once, shame on you. Fool me twice shame on me? I was not going to be fooled twice. The first thing I thought was: 120B parameters on LPDDR5X? C’mon now. The DGX Spark has 128GB of unified LPDDR5X at 273 GB/s and I just spent five months learning why that wasn’t enough to run 70B models at interactive speeds. You’re claiming 120B on the same memory technology. What is this Voodoo magic you have…
The next few days became a scavenger hunt trying to piece together how does this device work, what is all this tech they claim to be so ground breaking and how can they defy the physics that NVIDIA were not able to fool.
What I found was three layers of technical misdirection stacked on top of forked academic research, wrapped in a corporate structure designed for opacity, and sold through a Kickstarter campaign with a $10,000 funding goal that has now raised $1.7 million from 1,266 backers.
Let’s get started.
Part 1: The Promise
TiinyAI’s pitch is simple, the Pocket Lab is “the first pocket-size AI supercomputer.” It plugs into your laptop via USB-C, has 80GB of LPDDR5X RAM, an ARM SoC with a 30 TOPS co-located NPU, a 160 TOPS dedicated external NPU (190 TOPS total), and can run large language models with up to “120 billion” parameters locally. No cloud, no GPU, no subscription fees. Zero token costs, always on, private and offline.
The Kickstarter launched March 11, 2026. It hit $1 million from 728 backers in its first five hours. As of today, it sits at $1,737,722 from 1,266 backers, 17,377% of its $10,000 goal. The super early-bird price is $1,399. If you placed a $9.90 deposit on their website beforehand, you get it for $1,299. Estimated delivery: August 2026.
... continue reading