jeosol / llm-post-training Star 1 Code Issues Pull requests LLM Post-Training, RLHF, PPO, DPO, etc post-training ppo dpo post-training-quantization llm rlhf llmalignment post-training-learning llmfinetuning Updated Apr 10, 2026 Jupyter Notebook
trisanu-das / CRISP Star 0 Code Issues Pull requests Critic-free Reward-Integrated Self-distillation Policy Optimization reinforcement-learning self-distillation post-training-learning Updated Jun 7, 2026 Python
theMethodolojeeOrg / Axon Star 0 Code Issues Pull requests Discussions Coordinated AI systems evolving together. macos ios network-topology relational-intelligence adaptive-intelligence predicate-logging egoic-memory allocentric-memory post-training-learning lamarkian lamarckian-evolution Updated Jun 13, 2026 Swift