Dotplan
Updated June 12, 2026
June 12, 2026
Finishing up the first build outs of the portfolio website. For now it’ll be simple, just a place I can write daily notes and thoughts down (dotplan) and also do writeups of my projects in /posts.
Current focus is in finishing up the Azuki TCG PPO model while I finish moving into my new apartment. Right now I’m testing out Anthropic’s Fable with some goals towards that end.
The two goals running right now:
-
Fable running on a lambda instance to do a conversion from the original C engine used for simulation to a JAX based engine. This is based on a paper I found that explored utilizing models to re-create sims in more performant languages. The models are able to use the pre-existing, less performant engine as a validation while putting together the new engine in the new language. The goal has been running for 24 hours now, but from seeing existing metrics, I should expect about a 5x performance improvement in the full training flow. Outside of the training flow, the JAX sim seems to run 100x faster than the C sim. This makes me think that after this, I’ll probably have to see if there’s any improvements I can make in the actual training pipeline itself. But I’ll take it step by step, as while it’ll be nice, if I don’t have to do that to get to the end state of a model that can both build a deck and play the game intelligently, I’ll stick with that so I can move onto my robotics projects.
-
Another fable goal, but this one is simply doing ablations based on several papers to verify that both deck building and game playing is possible and also structuring the rewards in a way that leads to the outcome I want. Original models did show some strategy, but deck building did show right now that models seem to really only prioritize aggresion instead of utilizing the different playstyles that come out of playing different types of decks (gates, leaders, and overall deck composition).