Skip to content

Fix zmq and ROSS compilation issues#240

Open
sanjaychari wants to merge 9 commits into
codes-org:masterfrom
sanjaychari:kronos-develop-director-b
Open

Fix zmq and ROSS compilation issues#240
sanjaychari wants to merge 9 commits into
codes-org:masterfrom
sanjaychari:kronos-develop-director-b

Conversation

@sanjaychari
Copy link
Copy Markdown

@sanjaychari sanjaychari commented May 21, 2026

The kronos-develop-director-b branch of CODES was using an outdated version of ROSS and also had compilation issues because of zeromq and CUDA. This PR changes it to be compatible with the master branch of ROSS and fixes the zeromq and CUDA compilation issues.

kevinabrown and others added 5 commits June 17, 2024 00:13
The kronos-develop-director-b branch of CODES
was using an outdated version of ROSS and also
had compilation issues because of zeromq. This
commit changes it to be compatible with the master
branch of ROSS and fixes the zeromq compilation
issues.
@sanjaychari sanjaychari changed the title Fix zmq and ROSS compilation issues [WIP] Fix zmq and ROSS compilation issues May 21, 2026
Compilation with torch-jit was not occuring even with torch_enable set to 1.
This commit fixes torch-jit compilation with GPU support.
@sanjaychari
Copy link
Copy Markdown
Author

sanjaychari commented May 21, 2026

@sanjaychari sanjaychari changed the title [WIP] Fix zmq and ROSS compilation issues Fix zmq and ROSS compilation issues May 21, 2026
@sanjaychari
Copy link
Copy Markdown
Author

I ran a sequential simulation with a dummy PyTorch checkpoint file and this code works for sequential simulation. Conservative and optimistic simulation have some issues with GVT consistency but that might be solved by an accurate ML model, or in a separate pull request.

@sanjaychari
Copy link
Copy Markdown
Author

sanjaychari commented May 21, 2026

The GVT consistency issues with optimistic mode were happening because of network_treatment_on_switch being set to "freeze" in the CODES conf file. Events from the ML model were scheduled to arrive before GVT and were sent without any delay when received by the PDES simulation after the switch, and ROSS reported these as stragglers.

Changing network_treatment_on_switch to "nothing" fixes the issue.

@caitlinross
Copy link
Copy Markdown
Member

i'm good with merging this

@sanjaychari sanjaychari changed the base branch from kronos-develop-director-b to master May 26, 2026 17:37
This commit makes the kronos-develop-director-b branch compatible with the master branch
and introduces ML modelling code to be used with the director.
@sanjaychari sanjaychari force-pushed the kronos-develop-director-b branch from 63693c0 to 51f691b Compare May 27, 2026 14:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants