...
- Cleanup of the Arm builds, simplifying ARMv8 vs cores and adding support for more cores
- Performance improvement across the board and guaranteeing ARMv8 only holds ARMv8.0 code (not potentially v8.1 as before)
- Tested on Synquacer (A53), D03 (A57), D05 (A72), ThunderX, Amberwing (Falkor), ThunderX2, Moonshot (XGene)
- Pull request: https://github.com/xianyi/OpenBLAS/pull/1876
- Would be good for Fujitsu to test that code on Post-K
- ThunderX2 builds might actually be good for Post-K (larger caches)
- Will need to add
march=armv8.2+sve
(inMakefile.arm64
) to see SVE code coming out - We can later add Post-K mode when cpuinfo/cache/TLB details are public
OpenHPC
- Mellanox code rebooting nodes on non-SMS machines
- Will send pull request to master later
- Working on Baptiste's code in a new branch inside Fujitsu
- @Baptiste to add a branch with all the patches for Fujitsu
- Testing IB changes (MOFED) in HPC Lab, working so far
LLVM
- Completed moving work into Linaro's git
- New branch for regalloc, initial support for control flow, but not split&spill
- Found JumpThreading bugs, fixed
- Created random testing for branch elimination, will run next week
- Some new basic blocks added, need to check vectoriser still recognise all patterns
Infrastructure tasks
- Tried to PXE boot ThunderX2, changing parameters in BIOS, provisioner, will try different ports next week
- Tried to upgrade Amberwing's firmware, but getting unknown failures, in contact with Qualcomm
...