Table of Contents | ||
---|---|---|
|
Attendance
Committee Members
Name | Present |
---|---|
Kanta Vekaria (OCTO, Linaro) | |
Martin Stadtler (Director of LEG, Linaro) | |
Darren Cepulis (ARM) |
|
Eric Van Hensbergen (ARM) | |
Kangkang Shen (HiSilicon) |
|
Bhalchandra Deshpande(Broadcom) |
|
Larry Wikelius (Cavium) | |
Gary Yurcak (Qualcomm) |
|
Rammohan Peddibhotla (Qualcomm) |
|
Grant Likely (HPE) |
|
Jon Masters (RedHat) |
|
Elsie Wahlig (Qualcomm) |
|
Steve Heist (Qualcomm) | |
Takeharu Kato (Fujitsu) |
Guests
Name | Present |
---|---|
David Rusling (CTO, Linaro) | |
Mark Orvek (EVP, Linaro Engineering) | |
Andrea Gallo (VP of Segment Groups, Linaro) |
|
Anoop Saxena (Project Manager, Segments) |
|
Francoise Ozog (Director of LNG, Linaro) |
|
Leif |
|
Ard |
|
Steve Capper | |
Tom Gall |
Agenda
Previous Actions
View file name HPC SIG NUMA & MicroArch.pdf height 250
- Discussion on Numa for HPC (Leif, Graeme, Ard)
- Discussion on microarchitecture detection (Leif, Graeme, Ard)
- Agree on the time that this meeting should take place
- HPC collaborate pages (Kanta)
- Next meeting 22 Nov agenda
- Engineering status
- post SC16 feedback
Minutes
Previous Actions
Martin: No update yet
Kanta: Pitch deck for TAB for SC16
Discussion on Numa for HPC (Leif, Graeme, Ard)
Leif discussing: NUMA basic exists, if we want to have proper NUMA we need to do more work. Leif suggests that if we want this to happen, we need to get the ARM teams doing scheduling and power scheduling.
There is a pending review for (Steve can you add that proposal ASWGxxxxx)
Steve: Focussing on topology
Discussion on microarchitecture detection (Leif, Graeme, Ard)
Benchmarking
Link time optimisation needs reliable information from the kernel
How many variants?
12 right now could be higher
How much of the software will benefit?
Can target a certain number of libraries such as BLAS libraries
Explore dynamically determining heterogeneous environments
E.g little, medium and big cores needs migration with cores
Training is needed to optimise against a specific arch
Can use 1 set for point training
Elsie: Are we thinking about adding GPGPU as part of the heterogeneous arch?
Eric: was only focusing on cores but agrees it’s wider
Grant: Agrees with Elsie but we need to maintain focus on key areas
Key priorities: HPC workloads working optimally for different microarch. Need to looking in to openCL
Gen-Z interest
Grant: Getting 52 bit address enablement is important. Non trivial amount of work and needed for the distros
Distros enablement
Next steps: Kernel training methodology using BLAS FFTW test cases
Pick a library for self training
Agree on the time that this meeting should take place
No objection from attendees to move this to an hour earlier
HPC collaborate pages (Kanta) - postponed to next meeting
Next meeting 22 Nov agenda
Engineering status
post SC16 feedback