Directory | Changes | Lines of Code | Lines per Change |
---|---|---|---|
Totals | 44 (100.0%) | 623 (100.0%) | 14.1 |
CCA/Components/Schedulers/ | 25 (56.8%) | 557 (89.4%) | 22.2 |
Core/Grid/ | 4 (9.1%) | 39 (6.3%) | 9.7 |
CCA/Components/LoadBalancers/ | 1 (2.3%) | 6 (1.0%) | 6.0 |
include/ | 2 (4.5%) | 5 (0.8%) | 2.5 |
CCA/Components/ICE/ | 3 (6.8%) | 5 (0.8%) | 1.6 |
CCA/Components/MPM/ | 4 (9.1%) | 4 (0.6%) | 1.0 |
CCA/Ports/ | 3 (6.8%) | 3 (0.5%) | 1.0 |
CCA/Components/SimulationController/ | 1 (2.3%) | 3 (0.5%) | 3.0 |
CCA/Components/Arches/ | 1 (2.3%) | 1 (0.2%) | 1.0 |
When a reduction variable being computed by multiple tasks,
schedule the reduction task after the very last task that computes this variable.
27 lines of code changed in 5 files:
remove reduction task map
0 lines of code changed in 3 files:
restrict to use only one thread for the first timestep
We should treat restart and init timestep separately in the future,
so that init timestep can use multi-thread while restart timestep use only one thread.
6 lines of code changed in 2 files:
fix hash map namspace issue
6 lines of code changed in 3 files:
Removed many of datawarehouse locks by using atomic operations
401 lines of code changed in 19 files:
add computesWithScratchGhost for Wasatch
37 lines of code changed in 2 files:
better scheduling copydata timestep
32 lines of code changed in 1 file:
parallelly compute refineset
32 lines of code changed in 1 file:
fixed a mismatch of task computes and dw put.
task computes a reduction var into null(global) level, however puts it into level 0
1 lines of code changed in 1 file:
fix memory leak of postD2H and postH2D copies.
4 lines of code changed in 1 file:
Loop through only neigbours instead of all processors when assign resources/nodes to ONCE_PER_PROC task.
This change speeded up the assign resource method by 6900X on Mira with 256K cores.
6 lines of code changed in 1 file:
When more than one tasks compute a same reduction variable into same level,
only schedule one reduction task after the very last task that computes this reduction variable.
28 lines of code changed in 3 files:
Turn off topologicalSort() of task graph compiling.
Move reduction task generation part of topo sort to ScheduerCommon.
This greatly speeded up the taskgraph compling on very large scale.
43 lines of code changed in 2 files: