Torch-native SDPO benchmark training with optional Modal-backed train steps.
python -m continualcode model_name=Qwen/Qwen3-4B-Instruct-2507
python -m continualcode.benchmarks.lcb_eval split=test max_samples=100
metrics.jsonlsamples.jsonlcheckpoint_dircontinualcode/benchmarks/auto_train.pycontinualcode/modal_train.pycontinualcode/model_utils.pycontinualcode/sdpo_loss.py