link
2026-06-23
·
↪ stephen.bochinski.dev
the hardware you buy today may look like a bad bet in a year. for most people, renting open source models from a provider at API rates is the right call. a blend of frontier subscr…
practical breakdown of the three approaches to local AI development without handwaving. the author's actual take matters: self-hosting only wins if you can keep hardware busy (most can't), API rental avoids the sunk cost…
link
2026-06-23
·
↪ github.com
Rio-3.5-Open-397B's weights are a direct element-wise merge of Nex with Qwen3.5-397B-A17B at roughly 0.6/0.4, with no evidence of independent training. Every weight tensor across a…
straightforward model forensics. the authors demonstrate weight interpolation through statistical analysis and behavioral testing—when you strip the system prompt, the model identifies itself as Nex. worth reading as a c…
link
2026-06-23
·
↪ vickiboykis.com
with the most recent releases from Google in the Gemma 4 family, I've finally been able to do agentic coding locally and have loops work at about ~75% the accuracy/speed of frontie…
pragmatic take on local model viability. vickiboykis documents the shift from toy to usable—not hype, just "do i have to double-check against an API?" as the metric. the piece avoids breathless claims but makes a real ob…
link
2026-06-23
·
↪ 9to5google.com
Chrome 150 removes the kExtensionManifestV2Disabled flag, closing the final workaround for Manifest V2 ad blockers. Chrome 151 will remove remaining legacy flags entirely.
the mv3 transition is functionally complete. google's been signaling this for years, but the removal of the escape hatch matters: ublock origin, ad nauseum, and similar tools that relied on mv2's webRequest API are now g…
link
2026-06-23
·
↪ notnotp.com
CTOs are adopting Kubernetes primarily for organizational benefits—uniformity, hireable knowledge, auditability—not because their technical problems require it. Most companies shou…
good pushback on k8s cargo-culting. the author's observation that managed k8s maturity + talent pool flip caused adoption, not technical necessity, rings true. the tradeoff analysis is honest: yes, you get gitops and com…
link
2026-06-23
·
↪ www.theartofdoingstuff.com
The thing has somehow become both more sophisticated and less useful at the same time. That's not easy to accomplish without a lot of planning amongst a lot of people.
worthwhile rant on the death of utilitarian software through featuritis. google home's degradation from simple q&a to verbose, hedged gemini is a concrete example of optimization theater—committees adding guardrails and …
link
2026-06-23
·
↪ paulbuchheit.blogspot.com
If your product is great, it doesn't need to be good. Pick three key attributes or features, get those things very, very right, and then forget about everything else.
paul buchheit's 2010 take on product design holds up because it cuts through the feature-creep trap that kills most projects. the core insight: shipping with 80% effort on three core things beats shipping with everything…
link
2026-06-23
·
↪ tales.fromprod.com
Google Workspace appears to be starting to warn users from Firefox that they must use Chrome. Google support called and claim this will only happen for admins trying to access admi…
worth tracking as a case study in vendor lock-in via UX friction. google's response is classic: support deflects, documentation stays silent, and the threat persists in production. the author—who's actually the admin—get…
link
2026-06-23
·
↪ ianbarber.blog
The complexity came from the tension between the need to continually increase capabilities and the need to stay efficient, particularly for inference. If you want to swap attention…
barber traces how llms went from clean transformer stacks to baroque collections of specialized attention variants, moe routing, and cross-gpu communication ops. the parallel to recsys is instructive: once performance be…
link
2026-06-08
·
↪ www.0xsid.com
a zero auth password reset in production with no check on whether the provided email was actually used before. the support AI just changes the linked email if you ask nicely enough…
instagram's account recovery flow is a masterclass in what not to do. attacker spoofs location via vpn, contacts support ai claiming compromise, redirects verification codes to attacker-controlled email, done. 2fa gets n…
link
2026-06-08
·
↪ www.techradar.com
Citizens will be on their best behavior, because we're constantly recording and reporting everything that is going on. AI will automatically report issues while analyzing footage c…
ellison articulating the panopticon-as-a-service architecture we're building. worth reading not for the orwell comparison (predictable) but as a concrete description of the stack: ubiquitous sensors + real-time video pro…
link
2026-06-08
·
↪ blog.alexellis.io
k3s is a stripped-down Kubernetes distribution that comes in at around 40mb and bundles all low-level components required such as containerd, runc and kubectl. Unlike kubeadm which…
practical walkthrough of running k3s on raspberry pi clusters. ellis covers the real constraints—kubeadm's assumptions don't hold on arm hardware, wifi latency kills kubernetes, fake sd cards ruin your day. the tradeoff …
link
2026-06-08
·
↪ github.com
GPU-accelerated UI framework for Zig, targeting macOS (Metal), Linux (Vulkan/Wayland), and Browser (WASM/WebGPU). Zero external Zig package dependencies — builds against system fra…
gooey is a pragmatic take on cross-platform GPU UI: hybrid immediate/retained mode, declarative components with flexbox layout, pure state separation. the zero-dependency approach is notable—no zig package ecosystem lock…
link
2026-06-08
·
↪ www.404media.co
While Google CEO Sundar Pichai proudly tells the world that 75 percent of all new code at the company is AI-generated, internally Google employees are sharing memes about how AI is…
the gap between executive messaging and ground truth. when a company's own engineers are meme-ing about tool limitations, there's a signal worth hearing. this isn't about whether ai code generation is useful—it clearly i…
link
2026-06-08
·
↪ www.jeffgeerling.com
High end server hardware has this feature built-in (HP's ILO, Dell's iDRAC, or IPMI), but not everyone has access to server motherboards. For the rest of us, there's now an explosi…
comprehensive hands-on comparison of IP KVMs from PiKVM down to budget alternatives. geerling tests the practical cases where remote desktop/VNC fall short—benchmarking without resource overhead, accessing locked/powered…
link
2026-06-08
·
↪ www.0xkato.xyz
Tokens aren't usually whole words. They're usually subword pieces... Subword tokenization sits in the middle. The most common pieces become single tokens, and rare or novel words g…
solid pedagogical walk through transformer internals. covers the stack systematically—tokenization through generation—without hand-waving the math away entirely. the strawberry-R's example is good; tokenization choices h…
link
2026-06-08
·
↪ www.cnbc.com
SpaceX will provide Google with ~110k Nvidia GPUs across data centers through June 2029 at $920M/month, with penalties if capacity isn't delivered by Sept 2026. The deal comes as S…
interesting window into the actual economics of compute infrastructure arbitrage. SpaceX built massive capex (mostly in Memphis) betting on xAI's Grok, which hasn't materialized—so now they're renting excess capacity to …
link
2026-06-01
·
↪ pganalyze.com
In Postgres 17 we do not yet have asynchronous I/O, but we do see a performance benefit from what is essentially a refactoring work around the internal APIs. The streaming and vect…
solid groundwork for eventual async I/O in postgres. the key insight here is that refactoring internal read APIs to handle multiple buffers at once yields immediate wins—10% faster ANALYZE, better sequential scan through…
link
2026-06-01
·
↪ xata.io
Separate per-tenant data tables from the control plane tables. Place the per-tenant data tables in the region closest to where you expect the users to be. Create a global view of t…
practical walkthrough of geo-distributing postgres for multi-tenant systems without requiring specialized db features. the pattern is straightforward: segment by tenant, keep control plane centralized, use fdw for cross-…
link
2026-06-01
·
↪ www.wsj.com
Apple's privacy changes (ATT framework) created asymmetric outcomes: hurt ad-targeting for Snap and Meta while Google's first-party data advantages insulated them from the impact.
paywall'd but the premise is solid—worth digging into the mechanism. Apple's iOS privacy changes exposed the fragility of third-party cookie/IDFA reliance. Google weathered it better due to search monopoly and existing f…