PivotRL
Work in progress on distillation-aware reinforcement learning workflows for compact policy transfer.
Read more →thoughts too long for a tweet
Work in progress on distillation-aware reinforcement learning workflows for compact policy transfer.
Read more →WolfeClick wraps Pokemon Showdown as an OpenEnv-compatible environment so LLMs can learn legal action selection and long-horizon strategy from live battles.
Read more →A practical setup and usage guide for Trip AI, including itinerary generation, customization, and local development tips.
Read more →An experiment-driven explanation of estimating N-Queens search complexity with Monte Carlo sampling and backtracking.
Read more →How EasyQuizzes combines retrieval-augmented generation and vector search to generate study flashcards from notes and PDFs.
Read more →A clean single-pass Sudoku validator using hash sets to track rows, columns, and 3x3 boxes efficiently.
Read more →A hands-on walkthrough of building a Unix-style shell in C using fork, exec, pipes, and process control.
Read more →