Burned some token for a codebase audit ranking

This experiment is nothing scientific, would have needed a lot more work. Picked a vibe coded app that was never reviewed and did some funny quota burning and local runs (everything 120B and down was local on RTX3090+RTXA4000+96RAM). Opus 4.6 in antigravity was the judge. Hot take: without

✦ Editorial Summary

The author conducted an informal experiment to audit a codebase, using a specific app andОт tools to evaluate its performance. The test was not scientifically rigorous and relied on local runs on high-end hardware. The results were evaluated using Opus 4.6 in antigravity mode.

r/LocalLLaMA·reddit.com·Mar 15, 2026·1 min read·1 pts

Read original at reddit.com More Science

WOKHEI The excerpt above is sourced from the original publication. WokHei does not add editorial bias. Click the link below to read the full article at the source.

Discussion

Join the discussion

Sign in for a verified badge and your comments appear instantly. Or post anonymously — anonymous comments are held briefly for moderation.