<?xml version="1.0" encoding="utf-8" standalone="yes"?><feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en"><script src="https://www.rss.style/js/atom-style.js" xmlns="http://www.w3.org/1999/xhtml"/><title>Tower of Kubes</title><link rel="self" type="application/atom+xml" hreflang="en" href="https://www.towerofkubes.com/tags/claude/feed.xml"/><link rel="alternate" type="text/html" hreflang="en" href="https://www.towerofkubes.com/tags/claude/"/><link rel="alternate" type="application/rss+xml" hreflang="en" href="https://www.towerofkubes.com/tags/claude/index.xml"/><id>/</id><updated>2026-01-13T00:00:00Z</updated><author><name>Ro'i Bandel</name></author><generator>Hugo 0.157.0</generator><entry><title>Claude Code Sandboxing</title><link rel="alternate" type="text/html" hreflang="en" href="https://www.towerofkubes.com/articles/claude-code-sandboxing/"/><id>https://www.towerofkubes.com/articles/claude-code-sandboxing/</id><updated>2026-01-13T00:00:00Z</updated><summary type="html">Ways to run Claude Code in a sandbox</summary><content type="html"><![CDATA[<p>A couple of days ago, my coworker Roey Wullman wrote this article: <a href="https://www.develeap.com/claude-code-sandboxing-stop-babysitting-your-ai-assistant/roey/"  target="_blank" rel="noreferrer">Claude Code Sandboxing: Stop Babysitting Your AI Assistant</a> (published in <a href="https://www.develeap.com/magazine/"  target="_blank" rel="noreferrer">Develeap’s Magazine</a>).</p>
<p>This morning, I saw the latest announcement by Anthropic: <a href="https://claude.com/blog/cowork-research-preview"  target="_blank" rel="noreferrer">Introducing Cowork | Claude</a>, then read the <a href="https://news.ycombinator.com/item?id=46593022"  target="_blank" rel="noreferrer">comments on Hacker News</a>. Some of the comments discussed how secure Cowork is (or isn’t) and how it’s sandboxing works. Then other comments mentioned different approaches of sandboxing <a href="/articles/claude-sonnet-4.5-and-claude-code-2.0/" >Claude Code</a> (e.g. <a href="https://news.ycombinator.com/item?id=46594916"  target="_blank" rel="noreferrer">this comment</a> and <a href="https://news.ycombinator.com/item?id=46594059"  target="_blank" rel="noreferrer">these comments</a>).</p>

<h2 class="relative group">Ways to Sandbox Claude Code
    <div id="ways-to-sandbox-claude-code" class="anchor"></div>
    
</h2>
<ul>
<li><a href="https://www.develeap.com/claude-code-sandboxing-stop-babysitting-your-ai-assistant/roey/"  target="_blank" rel="noreferrer">Claude Code Sandboxing: Stop Babysitting Your AI Assistant - Develeap</a></li>
<li><a href="https://github.com/nezhar/claude-container"  target="_blank" rel="noreferrer">nezhar/claude-container: Container workflow for Claude Code. Complete isolation from host system while maintaining persistent credentials and workspace access.</a></li>
<li><a href="https://github.com/ashishb/amazing-sandbox"  target="_blank" rel="noreferrer">ashishb/amazing-sandbox: Amazing Sandbox  - inspired from https://ashishb.net/programming/run-tools-inside-docker/</a></li>
<li><a href="https://github.com/dagger/container-use"  target="_blank" rel="noreferrer">dagger/container-use: Development environments for coding agents. Enable multiple agents to work safely and independently with your preferred stack.</a></li>
<li><a href="https://github.com/mensfeld/claude-on-incus"  target="_blank" rel="noreferrer">mensfeld/claude-on-incus: Run coding agents in isolated Incus containers with session persistence, workspace isolation, and multi-slot support.</a></li>
</ul>
<hr>
<p><em>Featured image by <a href="https://unsplash.com/@markusspiske?utm_source=hugo&utm_medium=referral"  target="_blank" rel="noreferrer">Markus Spiske</a> on <a href="https://unsplash.com/photos/green-and-black-tractor-toy-KU3lOAiP-tQ?utm_source=hugo&utm_medium=referral"  target="_blank" rel="noreferrer">Unsplash</a>.</em></p>
]]></content><author><name>Ro'i Bandel</name></author><category term="claude" label="Claude" scheme="https://www.towerofkubes.com/tags/claude/"/><category term="ai" label="Ai" scheme="https://www.towerofkubes.com/tags/ai/"/><category term="llm" label="Llm" scheme="https://www.towerofkubes.com/tags/llm/"/><category term="security" label="Security" scheme="https://www.towerofkubes.com/tags/security/"/><published>2026-01-13T00:00:00Z</published></entry><entry><title>My Experience with Claude Sonnet 4.5 and Claude Code 2.0</title><link rel="alternate" type="text/html" hreflang="en" href="https://www.towerofkubes.com/articles/claude-sonnet-4.5-and-claude-code-2.0/"/><id>https://www.towerofkubes.com/articles/claude-sonnet-4.5-and-claude-code-2.0/</id><updated>2025-10-06T00:00:00Z</updated><summary type="html">Hands-on review of Claude Sonnet 4.5 and Claude Code 2.0 for developers: coding experience, benchmarks, usage limits, and workflow tips.</summary><content type="html"><![CDATA[<p>After the announcement of Claude Sonnet 4.5 and Claude Code 2.0, I finally had a little bit of time to experiment with the new Claude versions today.</p>
<p>My first impressions is Claude Sonnet 4.5 feels <em>slightly</em> better than Sonnet 4. At least that’s more than I can say for <a href="/articles/gpt-5/" >GPT-5</a>, which my first impressions of weren’t as positive (it felt like a downgrade compared to o3, but I’ve gotten used to it).</p>
<p>Honestly, it’s hard to tell though. I find it hard to give objective feedback on LLM models. There are benchmarks that claim to be objective, but benchmarks don’t tell the full story of how a model actually feels in real world use. It’s kind of similar to how phone benchmarks don’t necessarily tell the fully story on how smooth a phone actually feels in real world use; for example Google Pixel models are not technically as powerful as some of the competition, but have optimized software that makes them feel smooth to use.</p>
<p>When evaluating LLM models, I try to use them as normal. Sometimes I give the same prompt to different LLM models to gauge the differences in answers and which gives the “best” response. However, even that is not always effective; since LLM answers are non-deterministic and even asking the same model inside the same tool the same prompt twice can give different answers (sometimes even wildly different). The differences can be even larger when using the same model across different tools. I feel like I get significantly different answers when using <a href="/articles/gpt-5/" >GPT-5</a> in ChatGPT 5, Microsoft Copilot, Cursor CLI, Codex CLI and Perplexity Pro.</p>
<p>Which brings me back to today. I was working on documentation frameworks, specifically setting up Docusaurus, with Claude Code 2.0 and Sonnet 4.5. This is actually a task I’ve done several times in the past with previous versions of Claude Code using the Sonnet 4 model. This time, I was trying to vibe code less and actually understand every line of code I was writing so that I would eventually feel confident deploying Docusaurus in production (using <a href="/series/static-website-hosting/" >static website hosting</a>). Nevertheless, I still used Claude Code to help me with some menial tasks, while making an effort to read every single line of code (rather than just “vibe coding”). Because I have done this task before, it might have been a decent benchmark if I had actually tried to examine it in that way, but really I was just trying to get a task done.</p>
<p>As for the results? I managed to achieve what I was trying to do, but really my goal in the first place was to rely less on AI. I still consulted Claude Code frequently. It gave some good responses, some dumb responses and some mid responses. Not too different from usual, maybe <em>slightly</em> better, but again hard to tell. I don’t plan to make a more rigorous test of Sonnet 4 vs Sonnet 4.5, I don’t mind trusting the benchmarks in this case. In many benchmarks Sonnet 4.5 even beats Opus 4.1!</p>

<h2 class="relative group">Usage Limits
    <div id="usage-limits" class="anchor"></div>
    
</h2>
<p>Before I even had a chance to try it myself, I saw many posts on <a href="https://www.reddit.com/r/ClaudeCode/"  target="_blank" rel="noreferrer">r/ClaudeCode</a> complaining about usage limits getting worse. Many of these posts were from users paying for the expensive $100-$200/month Claude MAX plans. A lot of them complained about reaching usage limits faster than before while using Claude Opus 4.1 in Claude Code. It’s not clear to me why those users insisted on still using Opus 4.1 despite some benchmarks showing that Sonnet 4.5 has surpassed it, but to be fair the ability to use Opus in Claude Code is one of the selling points of the MAX plans. On my $20/month Claude Pro plan, I can only use Opus 4.1 on <a href="https://claude.ai"  target="_blank" rel="noreferrer">claude.ai</a>, not inside Claude Code. I haven’t found that a huge limitation though since I was still getting good results with Sonnet 4 and will presumably get even better results with Sonnet 4.5.</p>
<p>One of the most useful features added in Claude Code 2.0 is <a href="https://www.reddit.com/r/ClaudeAI/comments/1ntq8tv/introducing_claude_usage_limit_meter/"  target="_blank" rel="noreferrer"><code>/usage</code></a>, which allows to see daily and weekly usage. It still doesn’t show how much the tokens you use really cost, for that I still use <a href="https://ccusage.com/"  target="_blank" rel="noreferrer">ccusage</a>.</p>
<p>Unfortunately, this comes with new <a href="https://www.reddit.com/r/ClaudeAI/comments/1mbo1sb/updating_rate_limits_for_claude_subscription/"  target="_blank" rel="noreferrer">weekly rate limits</a>. I missed this at first but now I believe this might be the main cause of what the community has been complaining about it. Weekly rate limits were one of the features I disliked most about ChatGPT, back when o3 was limited to 50 prompts a week I was genuinely rationing my usage of o3. Since the launch of <a href="/articles/gpt-5/" >GPT-5</a>, the limits for ChatGPT 5 Thinking have been raised significantly, to the point that I don’t reach those limitations anymore.</p>
<p>As for Claude Code, until now I found the usage limits to be fairly reasonable. The limits were in 5 hour blocks, not daily or weekly. It would take me two full hours of heavy vibe coding before a limit was actually reached. In cases where I was taking a more active role in coding I often did not reach the limit at all. Even when the limit was reached, it was unlikely I would have to wait the full 5 hours, since often I would be either in the middle or near the end of the 5 hour block anyway (one time I only had to wait 5 minutes for the limits to reset). The end result was that I felt like I could practically use Claude Code as much as I want without really worrying about limits, since worse case I would just take a break and wait a few hours for all of the limits to reset. I also saw little value in the more expensive Claude MAX plans.</p>
<p>Now with the weekly limits, there is a larger risk of reaching them. After just one day of medium usage, I already used 11% of the weekly limit (which resets on 2025-10-12). I’m not that worried though, since reaching the limits if anything would give me more time to experiment with other <a href="/articles/agentic-cli-tools-comparison/" >agentic CLI tools</a>. I read that Codex CLI also <a href="https://www.reddit.com/r/codex/comments/1ncbocw/codex_weekly_limit/"  target="_blank" rel="noreferrer">has a weekly limit</a>; one user claimed that Codex is so much better than Claude Code that they ration it, use CC for easier tasks and save Codex for the more complex tasks. In any case, I believe using a combination of free AI tools and paid subscriptions is both more cost-effective and more insightful compared to committing to one tool and paying an expensive “MAX” subscription.</p>
<hr>
<p><em>Featured image by <a href="https://unsplash.com/@almoya?utm_source=hugo&utm_medium=referral"  target="_blank" rel="noreferrer">Aerps.com</a> on <a href="https://unsplash.com/photos/a-person-reads-restaurant-recommendations-on-their-phone-_c9iPLn7emA?utm_source=hugo&utm_medium=referral"  target="_blank" rel="noreferrer">Unsplash</a>.</em></p>
]]></content><author><name>Ro'i Bandel</name></author><category term="ai" label="Ai" scheme="https://www.towerofkubes.com/tags/ai/"/><category term="claude" label="Claude" scheme="https://www.towerofkubes.com/tags/claude/"/><category term="llm" label="Llm" scheme="https://www.towerofkubes.com/tags/llm/"/><published>2025-10-06T00:00:00Z</published></entry></feed>