ScienceAI 1/10

Burned some token for a codebase audit ranking

This experiment is nothing scientific, would have needed a lot more work. Picked a vibe coded app that was never reviewed and did some funny quota burning and local runs (everything 120B and down was local on RTX3090+RTXA4000+96RAM). Opus 4.6 in antigravity was the judge. Hot take: without
✦ Editorial Summary

The author conducted an informal experiment to audit a codebase, using a specific app andОт tools to evaluate its performance. The test was not scientifically rigorous and relied on local runs on high-end hardware. The results were evaluated using Opus 4.6 in antigravity mode.

r/LocalLLaMA·reddit.com·Mar 15, 2026·1 min read·1 pts
Read original at reddit.comMore Science
WOKHEI The excerpt above is sourced from the original publication. WokHei does not add editorial bias. Click the link below to read the full article at the source.
Discussion
Join the discussion
Sign in for a verified badge and your comments appear instantly. Or post anonymously — anonymous comments are held briefly for moderation.
WokHei Digest
Workflow insights, delivered
Curated tips, tools, and tutorials for builders — twice a month, no spam.
Customize topics first