Bi-Weekly HPC Engineering Sync Minutes
[ Attendance ] [ Dial in Information ] [ Agenda ] [ Minutes ]
2020-01-09
Attendance
Engineering Members
Name | Present |
|---|---|
Paul Isaac's (HPC Tech Lead, Linaro) | |
Baptiste Gerondeau (HPC Engineer, Linaro) | |
Masakazu Ueno (Fujitsu) | |
Masakai Arai (Fujitsu) | |
Nakashima Kouta (Fujitsu) |
Not present
Present
Optional / Guests
Name | Present |
|---|---|
Mark Orvek (VP Engineering, Linaro) | |
Elsie Wahlig (Sr Director LDCG, Linaro) | |
Graeme Gregory (LDCG Engineering Mgr, Linaro) | |
Victor Duan (Japan Country Mgr, Linaro) | |
Jammy Zhou (China Country Mgr, Linaro) | |
|
|
Dial in Information
Paul Isaac's is inviting you to a scheduled Zoom meeting.
Topic: 2020-01-09 HPC Engineering Meeting Agenda/Minutes
Join Zoom Meeting
https://zoom.us/j/2990402863
One tap mobile
+16465588656,,2990402863# US (New York)
+17207072699,,2990402863# US
Dial by your location
+1 646 558 8656 US (New York)
+1 720 707 2699 US
+1 877 853 5247 US Toll-free
+1 888 788 0099 US Toll-free
Meeting ID: 611 276 1834
Find your local number: https://zoom.us/u/axpe6BG2s
Location | Local Time | Time Zone | UTC Offset |
|---|---|---|---|
San Jose (USA - California) | Thursday, November 14, 2019 at 6:00:00 am | UTC-7 hours | |
London (United Kingdom - England) | Thursday, November 14, 2019 at 1:00:00 pm | UTC+0 hours | |
Paris (France - Île-de-France) | Thursday, November 14, 2019 at 2:00:00 pm | UTC+1 hour | |
Tokyo (Japan) | Thursday, November 14, 2019 at 10:00:00 pm | UTC+9 hours | |
Corresponding UTC (GMT) |
|
Agenda
Previous meeting notes:
Topic
Previous format/topics
Paul - Colo update
Baptiste - Tensorflow script. Lab Updates
Masaki Arai (Fujitsu) - Compiler Updates
AOB
Next Meetings:
Meeting 28 November 2019
SC'19 November 16-22 2019
Minutes (DRAFT - please add relevant links to your topics)
Previous meeting notes:
Topic
Colo Lab Updates - Second power connections added to most nodes. Additional nodes added to switch used for Warewulf builds (shared with openHPC test nodes).
Tensorflow script has been documented - comments appreciated - Building TensorFlow on AArch64
Problems with NumPy building. Bug appears in OpenHPC and RedHat. Perhaps consider fetching latest toolchain builds from Linaro Toolchain team. GCC bug continues to be a problem.
Compiler Updates - Open
LLVM updates continuing for A64FX by Fujitsu under NDA.
GCC updates for A64FX - there are currently no Fujitsu resources. Non-NDA resources cannot be used until the A64FX technical specification is made available.
A64FX specification 'may' be released 1Q2020.
AOB
Next Meetings:
Meeting 28 November 2019
SC'19 November 16-22 2019
Recording Link
2019-11-14
Attendance
Engineering Members
Name | Present |
|---|---|
Paul Isaac's (HPC Tech Lead, Linaro) | |
Baptiste Gerondeau (HPC Engineer, Linaro) | |
Masakazu Ueno (Fujitsu) | |
Masakai Arai (Fujitsu) | |
|
|
Not present
Present
Optional / Guests
Name | Present |
|---|---|
Mark Orvek (VP Engineering, Linaro) | |
Elsie Wahlig (Sr Director LDCG, Linaro) | |
Graeme Gregory (LDCG Engineering Mgr, Linaro) | |
Victor Duan (Japan Country Mgr, Linaro) | |
Jammy Zhou (China Country Mgr, Linaro) | |
|
|
Dial in Information
Paul Isaac's is inviting you to a scheduled Zoom meeting.
Topic: 2019-11-14 HPC Engineering Meeting Agenda/Minutes
Join Zoom Meeting
https://zoom.us/j/2990402863
One tap mobile
+16465588656,,2990402863# US (New York)
+17207072699,,2990402863# US
Dial by your location
+1 646 558 8656 US (New York)
+1 720 707 2699 US
+1 877 853 5247 US Toll-free
+1 888 788 0099 US Toll-free
Meeting ID: 611 276 1834
Find your local number: https://zoom.us/u/axpe6BG2s
Location | Local Time | Time Zone | UTC Offset |
|---|---|---|---|
San Jose (USA - California) | Thursday, November 14, 2019 at 6:00:00 am | UTC-7 hours | |
London (United Kingdom - England) | Thursday, November 14, 2019 at 1:00:00 pm | UTC+0 hours | |
Paris (France - Île-de-France) | Thursday, November 14, 2019 at 2:00:00 pm | UTC+1 hour | |
Tokyo (Japan) | Thursday, November 14, 2019 at 10:00:00 pm | UTC+9 hours | |
Corresponding UTC (GMT) |
|
Agenda
Previous meeting notes:
Topic
Previous format/topics
Paul - Colo update
Baptiste - Tensorflow script. Lab Updates
Masaki Arai (Fujitsu) - Compiler Updates
AOB
Next Meetings:
Meeting 28 November 2019
SC'19 November 16-22 2019
Minutes (DRAFT - please add relevant links to your topics)
Previous meeting notes:
Topic
Colo Lab Updates - Second power connections added to most nodes. Additional nodes added to switch used for Warewulf builds (shared with openHPC test nodes).
Tensorflow script has been documented - comments appreciated - Building TensorFlow on AArch64
Problems with NumPy building. Bug appears in OpenHPC and RedHat. Perhaps consider fetching latest toolchain builds from Linaro Toolchain team. GCC bug continues to be a problem.
Compiler Updates - Open
LLVM updates continuing for A64FX by Fujitsu under NDA.
GCC updates for A64FX - there are currently no Fujitsu resources. Non-NDA resources cannot be used until the A64FX technical specification is made available.
A64FX specification 'may' be released 1Q2020.
AOB
Next Meetings:
Meeting 28 November 2019
SC'19 November 16-22 2019
Recording Link
2019-10-31
Attendance
Engineering Members
Name | Present |
|---|---|
Paul Isaac's (HPC Tech Lead, Linaro) | |
Baptiste Gerondeau (HPC Engineer, Linaro) | |
Masakazu Ueno (Fujitsu) | |
Masakai Arai (Fujitsu) | |
|
|
Not present
Present
Optional / Guests
Name | Present |
|---|---|
Mark Orvek (VP Engineering, Linaro) | |
Elsie Wahlig (Sr Director LDCG, Linaro) | |
Graeme Gregory (LDCG Engineering Mgr, Linaro) | |
Victor Duan (Japan Country Mgr, Linaro) | |
Jammy Zhou (China Country Mgr, Linaro) | |
|
|
Dial in Information
Paul Isaac's is inviting you to a scheduled Zoom meeting.
Topic: 2019-10-31 HPC Engineering Meeting Agenda/Minutes
Join Zoom Meeting
https://zoom.us/j/2990402863
One tap mobile
+16465588656,,2990402863# US (New York)
+17207072699,,2990402863# US
Dial by your location
+1 646 558 8656 US (New York)
+1 720 707 2699 US
+1 877 853 5247 US Toll-free
+1 888 788 0099 US Toll-free
Meeting ID: 611 276 1834
Find your local number: https://zoom.us/u/axpe6BG2s
Location | Local Time | Time Zone | UTC Offset |
|---|---|---|---|
San Jose (USA - California) | Thursday, October 31, 2019 at 6:00:00 am | UTC-7 hours | |
London (United Kingdom - England) | Thursday, October 31, 2019 at 1:00:00 pm | UTC+0 hours | |
Paris (France - Île-de-France) | Thursday, October 31, 2019 at 2:00:00 pm | UTC+1 hour | |
Tokyo (Japan) | Thursday, October 31, 2019 at 10:00:00 pm | UTC+9 hours | |
Corresponding UTC (GMT) |
|
Agenda
Previous meeting notes:
Topic
Previous format/topics
Paul - Workstation config
Dev boards
Jinshui/Peter Lui (Futurewei) -
Back to Aug/Sept when Joshua Terry as the Linaro HPC-SIG technical lead, Peter discussed with him about the ARM HPC system integration and testing environment options. One of the options is Futurewei may be able to host an ARM64 cluster (for example, 16-32 2-socket nodes). But then Joshua left Linaro and the discussion paused.
Baptiste - Tensorflow script. Lab Updates
Masaki Arai (Fujitsu) - Compiler Updates
AOB
Next Meetings:
Meeting 14 November 2019
SC'19 November 16-22 2019
Minutes
Previous meeting notes:
Topic
Windows 10 Workstation config:
Qemu start for an emulated Aarch64/Ubuntu environment
"C:\Program Files\qemu"\qemu-system-aarch64 -m 8192 -cpu cortex-a57 -smp 4 -M virt -nographic -drive file=aarch64_flash0.img,format=raw,if=pflash -drive file=aarch64_flash1.img,format=raw,if=pflash -drive if=none,file=eoan-server-cloudimg-arm64.img,id=hd0 -device virtio-blk-device,drive=hd0 -drive if=none,file=cloud.img,id=hd1 -device virtio-blk-device,drive=hd1 -netdev tap,ifname=EthernetTAP,id=network01 -device e1000,netdev=network01,mac=52:54:00:12:34:56 -accel tcg,thread=multi
However, network connection not yet working. Comments appreciated.
Baptiste Comment: To be honest, going through Windows is a big added complexity to the setup. I would recommend installing an Ubuntu/Debian dual-boot and running QEMU (+ libvirt) from there if network problems persist. CUDA/GPU support should be solid on Ubuntu.
Paul - Dual boot is not currently an option as the only graphics card is the single Nvidia GPU (not paired with Intel embedded/Nvidia external) which causes lock-up. When nographics then Ubuntu still crashes complaining of ACPI issues. When ACPI turned off no storage is recognised. Therefore, qemu on Windows is the only option to emulate Aarch64 in this hardware (currently).
Lab Updates - hardware changes at Colo Nov.11/12 2019. Currently no known facility to host water-cooled nodes.
Tensorflow script has been documented - comments appreciated - Building TensorFlow on AArch64
Problems with NumPy building. Bug appears in OpenHPC and RedHat. Perhaps consider fetching latest toolchain builds from Linaro Toolchain team.
Futurewei hosting - How does affect 'EAR' restrictions for technology transfer if we login to a Futurewei hosted system?
Compiler Updates - Open
LLVM updates continuing for A64FX by Fujitsu under NDA.
GCC updates for A64FX - there are currently no Fujitsu resources. Non-NDA resources cannot be used until the A64FX technical specification is made available.
A64FX specification 'may' be released 1Q2020.
AOB
Next Meetings:
Meeting 14 November 2019
SC'19 November 16-22 2019
Recording Link
2019-06-05
Attending
Baptiste
Graeme
Elsie