The hpcnotes HPC blog - supercomputing, HPC, high performance computing, cloud, e-infrastructure, scientific computing, exascale, parallel programming services, software, big data, multicore, manycore, Phi, GPU, HPC events, opinion, ...
Friday, 8 November 2019
Guide to announcements for SC19
(Originally published on my LinkedIn profile: post link)
It's that time of year - yes, the annual fest of press releases and social media deluges in the run up to 'SC' - the primary annual supercomputing conference, held this year in Denver.
Here is a handy guide for vendor PR teams ...
[company] will be at #SC19!
Yes, along with almost everyone else in #HPC world
[company] will be highlighting products at SC19!
As above
[company] will launch new version of our current product in a slightly different shade of grey at SC19!
We had no actual news
However, the HPC centers are just as bad with "news" for the big annual #Supercomputing conference:
Wednesday, 7 November 2018
SC18 preview
I've written my customary preview of SC, which is now published at HPC Wire: https://www.hpcwire.com/2018/11/06/sc18-preview-big-in-dallas/.
Over 10,000 members of the global HPC community will gather in Dallas for the SC18 conference. Even a decent sized team will struggle to attend everything the official program has to offer. On top of this, there will be a plethora of public and private meetings outside the official program, many of which are more valuable than the official program. Plus, there will be the usual flood of press releases, social media blasts, etc.
Out of all of this, what will emerge as the key themes? What are some essential things to do/attend? Read the @hpcnotes SC18 preview to find out!
Over 10,000 members of the global HPC community will gather in Dallas for the SC18 conference. Even a decent sized team will struggle to attend everything the official program has to offer. On top of this, there will be a plethora of public and private meetings outside the official program, many of which are more valuable than the official program. Plus, there will be the usual flood of press releases, social media blasts, etc.
Out of all of this, what will emerge as the key themes? What are some essential things to do/attend? Read the @hpcnotes SC18 preview to find out!
Tuesday, 6 November 2018
SC18 Networking Receptions
Networking Receptions at SC18 Dallas [updated regularly until SC starts]
A huge part of the SC conference (or any HPC conference) is meeting people - from old friends to new contacts. Here is a curated list of networking opportunities (receptions) crowd-sourced from this twitter thread https://twitter.com/hpcnotes/status/1059437643837161474 and other sources:
Sunday 11th
- Spectrum Scale UG and happy hour, 12-4pm
- LLVM & Flang social, 6pm-9pm, Aloft, http://lists.llvm.org/pipermail/llvm-dev/2018-October/127303.html
- Intersect360, 2pm-4pm, Biergarten Restaurant, https://www.eventbrite.com/e/intersect360-research-sc18-networking-reception-tickets-51318644447
- DDN user group reception, 5pm, Old Red Museum of Dallas County History & Culture, following user group meeting https://www.ddn.com/company/events/user-group-sc/
- SC official opening gala, 7pm-9pm, https://sc18.supercomputing.org/presentation/?id=pec122&sess=sess284
- Beowulf Bash, 9pm, Eddie Dean's Ranch, https://beowulfbash.com/
- Women in HPC, 6.30pm-9pm, Cafe Herrera on Lamar (at the Omni Dallas), https://www.eventbrite.co.uk/e/whpcsc18-networking-and-careers-reception-tickets-51886141847
- IBM
- Cray
- DDN, 7pm, Reunion Tower, tickets via DDN booth #3213
- Nimbix, 6pm-10pm, d.e.c. on dragon st, https://www.eventbrite.com/e/nimbix-sc18-lounge-party-registration-50540352555
- OpenACC, 7pm-10pm, Eddie Dean's Ranch, https://www.eventbrite.com/e/9th-openacc-user-group-meeting-sc2018-tickets-50250658071
- Mellanox, 6.30pm, Sheraton, http://www.mellanox.com/sc18/event.php?ls=social-tw&lsd=11.5.18
- Boston Limited, 7.30pm Bob's Steak & Chop House, https://www.boston.co.uk/events/2018/sc18.aspx [pre-booking required via booth #3255]
Tweet me @hpcnotes using hashtag #SC18 to add your reception to this list!
Monday, 5 November 2018
SC18 Tutorials
At SC18, I will be leading two tutorials, along with my long-time co-presenter Owen Thomas and new co-presenter for SC18, Ingrid Barcena Roig.
- 8:30am - noon : "The Business of HPC: TCO, value, metrics, and more ..."
- 1.30pm - 5.00pm : "Procurement and Commissioning of HPC Systems"
Both tutorials are on Monday 12th November, in room C140 of the Dallas convention center.
Saturday, 23 June 2018
A useful reading list for travelling to ISC18
Travelling to Frankfurt for ISC? Need to feed your HPC thirst while on planes, trains, or in hotel rooms? Here is my pick of things to download and read so that you are fully informed when you start ISC:
- (Obviously!) The @hpcnotes ISC18 preview at HPC Wire
- The official welcome to ISC18 by the organizers, via InsideHPC
- An article on the race between USA and China (and others) to get to the top of the Top500 at WiredUK "Why the US and China's brutal supercomputer war matters" and my follow up thoughts: https://www.hpcnotes.com/2018/06/does-it-matter-whether-usa-china-eu-or.html
- Sandia to 2.3 deploy petaflops Cavium TX2 ARM supercomputer at Top500.org
- Details on Japan's post-K exascale supercomputer from The Next Platform
- [added Sat afternoon] Top500 interviews Thomas Sterling on Exascale, Chinese HPC, ML and non-von-Neumann
- [more to follow during the weekend ...
See you in Frankfurt!
Andrew / @hpcnotes
Wednesday, 20 June 2018
NAG-TACC HPC Leadership Institute 2018
Just taken over a HPC management or leadership role? Or hoping to soon? Or know someone who could grow into those roles? Or been a HPC director for years but value ongoing personal development?
The HPC Leadership Institute is a partnership between Numerical Algorithms Group (NAG) and Texas Advanced Computing Center (TACC) to deliver training on the business aspects of High Performance Computing. The training covers strategy, total cost of ownership (TCO), cloud vs on-site, supercomputer procurement, governance, user services, and much more.
The 2018 course will be held in Austin TX September 11-13. Learn more and register now at:
https://www.tacc.utexas.edu/education/institutes/hpc-leadership-institute
The HPC Leadership Institute is a partnership between Numerical Algorithms Group (NAG) and Texas Advanced Computing Center (TACC) to deliver training on the business aspects of High Performance Computing. The training covers strategy, total cost of ownership (TCO), cloud vs on-site, supercomputer procurement, governance, user services, and much more.
The 2018 course will be held in Austin TX September 11-13. Learn more and register now at:
https://www.tacc.utexas.edu/education/institutes/hpc-leadership-institute
Does it matter whether USA, China, EU, or someone else has the biggest supercomputer?
Much fuss will be made over the ORNL's new Summit supercomputer at the ISC18 event next week - in particular the fact that it means the USA replaces China as the home of world's fastest supercomputer according to www.top500.org. This brings the usual question as to whether it really matters which country has the biggest supercomputer.
Having a supercomputer 20%, or even 2x, faster than a competitor isn’t critical on its own, because it is possible to make up 20% or 2x actual competitive capability through better software, better people, or better service delivery practices.
However, a 10x faster supercomputer would be an issue, because that would typically reflect a political commitment to High Performance Computing (HPC) involving hardware and software and people - and so could mean potential capability dominance.
Of course, if you had the 2x slower supercomputer without investing in people/software/practices to make up the difference, then that would be a meaningful competitive gap and would matter.
Read more in this article at WiredUK: "Why the US and China's brutal supercomputer war matters"
Having a supercomputer 20%, or even 2x, faster than a competitor isn’t critical on its own, because it is possible to make up 20% or 2x actual competitive capability through better software, better people, or better service delivery practices.
However, a 10x faster supercomputer would be an issue, because that would typically reflect a political commitment to High Performance Computing (HPC) involving hardware and software and people - and so could mean potential capability dominance.
Of course, if you had the 2x slower supercomputer without investing in people/software/practices to make up the difference, then that would be a meaningful competitive gap and would matter.
Read more in this article at WiredUK: "Why the US and China's brutal supercomputer war matters"
Wednesday, 22 November 2017
Benchmarking HPC systems
At SC17, we celebrated the 50th edition of the Top500 list. With nearly 25,000 list positions published over 25 years, the Top500 is an incredibly rich database of consistently measured performance data with associated system configurations, sites, vendors, etc. Each SC and ISC, the Top500 feeds community gossip, serious debate, the HPC media, and ambitious imaginations of HPC marketing departments. Central to the Top500 list is the infamous HPL benchmark.
Benchmarks are used to answer questions such as (naively posed): “How fast is this supercomputer?”, “How fast is my code?”, “How does my code scale?”, “Which system/processor is faster?”.
In the context of HPC, benchmarking means the collection of quantifiable data on the speed, time, scalability, efficiency, or similar characteristics of a specific combination of hardware, software, configuration, and dataset. In practice, this means running well-understood test case(s) on various HPC platforms/configurations under specified conditions or rules (for consistency) and recording appropriate data (e.g., time to completion).
These test cases may be full application codes, or subsets of those codes with representative performance behaviour, or standard benchmarks. HPL falls into the latter category, although for some applications it could fall into the second category too. In fact, this is the heart of the debate over the continued relevance of the HPL benchmark for building the Top500 list: how many real-world applications does it provide a meaningful performance guide for? But, even moving away from HPL to “user codes”, selecting a set of benchmark codes is as much a political choice (e.g., reflecting stakeholders) as it is a technical choice.
Benchmarks are used to answer questions such as (naively posed): “How fast is this supercomputer?”, “How fast is my code?”, “How does my code scale?”, “Which system/processor is faster?”.
In the context of HPC, benchmarking means the collection of quantifiable data on the speed, time, scalability, efficiency, or similar characteristics of a specific combination of hardware, software, configuration, and dataset. In practice, this means running well-understood test case(s) on various HPC platforms/configurations under specified conditions or rules (for consistency) and recording appropriate data (e.g., time to completion).
These test cases may be full application codes, or subsets of those codes with representative performance behaviour, or standard benchmarks. HPL falls into the latter category, although for some applications it could fall into the second category too. In fact, this is the heart of the debate over the continued relevance of the HPL benchmark for building the Top500 list: how many real-world applications does it provide a meaningful performance guide for? But, even moving away from HPL to “user codes”, selecting a set of benchmark codes is as much a political choice (e.g., reflecting stakeholders) as it is a technical choice.
Labels:
benchmarking,
hpc,
hpl,
sc17,
top500
Friday, 29 September 2017
Finding a Competitive Advantage with High Performance Computing
High Performance Computing (HPC), or supercomputing, is a critical enabling capability for many industries, including energy, aerospace, automotive, manufacturing, and more. However, one of the most important aspects of HPC is that HPC is not only an enabler, it is often also a differentiator – a fundamental means of gaining a competitive advantage.
Differentiating (gaining a competitive advantage) through HPC can include:
Strategic delivery choices are the broad decisions about how to do/use HPC within an organization. This might include:
A key means of differentiating with HPC, and one of the most visible, is through the choice of hardware technologies used and at what scale. The HPC market is currently enjoying (or is it suffering?) a broader range of credible hardware technology options than the previous few years.
Differentiating with HPC
Differentiating (gaining a competitive advantage) through HPC can include:
- faster - complete calculations in a shorter time;
- more - complete more computations in a given amount of time;
- better - undertake more complex computations;
- cheaper - deliver computations at a lower cost;
- confidence - increase the confidence in the results of the computations; and
- impact - effectively exploiting the results of the computations in the business.
Strategic delivery choices are the broad decisions about how to do/use HPC within an organization. This might include:
- choosing between cloud computing and traditional in-house HPC systems (or points on a spectrum between these two extremes);
- selecting between a cost-driven hardware philosophy and a capability-driven hardware philosophy;
- deciding on a balance of internal capability and externally acquired capability;
- choices on the balance of investment across hardware, software, people and processes.
Which HPC technology?
A key means of differentiating with HPC, and one of the most visible, is through the choice of hardware technologies used and at what scale. The HPC market is currently enjoying (or is it suffering?) a broader range of credible hardware technology options than the previous few years.
Labels:
cloud,
competitive advantage,
consulting,
differentiation,
gpu,
hpc,
knl,
NAG,
phi,
skl,
supercomputing,
technology diversity
Monday, 31 July 2017
HPC Getting More Choices - Technology Diversity
HPC has been easy for a while ...
When buying new workstations or personal computers, it is easy to adopt the simple mantra that a newer processor or higher clock frequency means your application will run faster. It is not totally true, but it works well enough. However, with High Performance Computing, HPC, it is more complicated.
HPC works by using parallel computing – the use of many computing elements together. The nature of these computing elements, how they are combined, the hardware and software ecosystems around them, and the challenges for the programmer and user vary significantly – between products and across time. Since HPC works by bringing together many technology elements, the interaction between those elements becomes as important as the elements themselves.
Whilst there has always been a variety of HPC technology solutions, there has been a strong degree of technical similarity of the majority of HPC systems in the last decade or so. This has meant that (i) code portability between platforms has been relatively easy to achieve and (ii) attention to on-node memory bandwidth (including cache optimization) and inter-node scaling aspects would get you a long way towards a single code base that performs well on many platforms.
Increase in HPC technology diversity
However, there is a marked trend of an increase in diversity of technology options over the last few years, with all signs that this is set to continue for the next few years. This includes breaking the near-ubiquity of Intel Xeon processors, the use of many-core processors for the compute elements, increasing complexity (and choice) of the data storage (memory) and movement (interconnect) hierarchies of HPC systems, new choices in software layers, new processor architectures, etc.
This means that unless your code is adjusted to effectively exploit the architecture of your HPC system, your code may not run faster at all on the newer system.
It also means HPC clusters proving themselves where custom supercomputers might have previously been the only option, and custom supercomputers delivering value where commodity clusters might have previously been the default.
Labels:
cpu,
gpu,
hpc,
manycore,
memory,
software,
supercomputers,
supercomputing,
technology diversity
Tuesday, 11 July 2017
SC17 Tutorials - HPC cost models, investment cases and acquisitions
Following our successful HPC tutorials at SC16 and OGHPC17, I'm delighted to report that we've had three tutorials accepted for SC17 in Denver this November, all continuing our mission to provide HPC training opportunities for HPC people other than just programmers.
At SC17, we will be delivering these three tutorials:
The HPC procurement tutorial was successfully presented at SC13 (>100 attendees) and SC16 (~60 attendees). Feedback from the SC16 attendees was very positive: scored 4.6/5 overall and scored 2.9/3 for “recommend to a colleague”.
The HPC finance tutorial was successfully presented at SC17 (~60 attendees) and at the Rice Oil & Gas HPC conference 2017 (~30 attendees). Feedback from the SC16 attendees was very positive: scored 4.3/5 overall and scored 2.7/3 for “recommend to a colleague”.
The HPC business case tutorial is new for SC17.
The tutorials provide an impartial, practical, non-sales focused guide to the business aspects of HPC facilities and services (including cloud), such as total cost of ownership, funding models, showing value and securing investing in HPC, and the process of purchasing and deploying a HPC system. All tutorials include exploration of the main issues, pros and cons of differing approaches, practical tips, hard-earned experience and potential pitfalls.
Essential HPC Finance Practice: Total Cost of Ownership (TCO), Internal Funding, and Cost-Recovery Models
Me (Andrew Jones, @hpcnotes), Owen Thomas (Red Oak Consulting), and Terry Hewitt. We have been involved in numerous major HPC procurements and other strategic HPC projects since 1990, as service managers, bidders to funding agencies, as customers and as impartial advisors. We are all from the UK but have worked around the world and the tutorials will be applicable to HPC projects and procurements anywhere. The tutorials are based on experiences across a diverse set of real world cases in various countries, in private and public sectors.
These SC17 tutorials will deliver a lot of content in each half day. However, if you need more depth, or a fuller range of topics, or are looking for a CV step towards becoming a future HPC manager, then our joint TACC-NAG summer training institute is the right thing for you: "Where will future HPC leaders come from?"
Hope to see you at one (or more!) of our tutorials at SC17 this November in Denver.
@hpcnotes
At SC17, we will be delivering these three tutorials:
- [Sun 12th, am] "Essential HPC Finance: Total Cost of Ownership (TCO), Internal Funding, and Cost-Recovery Models"
- [Sun 12th, pm] "Extracting Value from HPC: Business Cases, Planning, and Investment"
- [Mon 13th, am] "HPC Acquisition and Commissioning"
Are these tutorials any good?
The HPC procurement tutorial was successfully presented at SC13 (>100 attendees) and SC16 (~60 attendees). Feedback from the SC16 attendees was very positive: scored 4.6/5 overall and scored 2.9/3 for “recommend to a colleague”.
The HPC finance tutorial was successfully presented at SC17 (~60 attendees) and at the Rice Oil & Gas HPC conference 2017 (~30 attendees). Feedback from the SC16 attendees was very positive: scored 4.3/5 overall and scored 2.7/3 for “recommend to a colleague”.
The HPC business case tutorial is new for SC17.
What is the goal of the tutorials?
The tutorials provide an impartial, practical, non-sales focused guide to the business aspects of HPC facilities and services (including cloud), such as total cost of ownership, funding models, showing value and securing investing in HPC, and the process of purchasing and deploying a HPC system. All tutorials include exploration of the main issues, pros and cons of differing approaches, practical tips, hard-earned experience and potential pitfalls.
What is in the tutorials?
Essential HPC Finance Practice: Total Cost of Ownership (TCO), Internal Funding, and Cost-Recovery Models
- Calculating and using TCO models
- Pros and cons of different internal cost recovery and funding models
- Updated from the SC16 base, with increased consideration of cloud vs in-house HPC
- Applicable to either a first investment or an upgrade of existing capability
- Most relevant to organizations with a clear purpose (e.g., industry) or those with a clear service mission (e.g., academic HPC facilities)
- Identifying the value, building a business case, engaging stakeholders, securing funding, requirements capture, market survey, strategic choices, and more
- Procurement process including RFP
- Specify what you want, yet enable the suppliers to provide innovative solutions beyond the specification both in technology and in the price
- Bid evaluation, benchmarks, clarification processes
- Demonstrate to stakeholders that the solution selected is best value for money
- Contracting, project management, commissioning, acceptance testing
Who are the tutors?
Me (Andrew Jones, @hpcnotes), Owen Thomas (Red Oak Consulting), and Terry Hewitt. We have been involved in numerous major HPC procurements and other strategic HPC projects since 1990, as service managers, bidders to funding agencies, as customers and as impartial advisors. We are all from the UK but have worked around the world and the tutorials will be applicable to HPC projects and procurements anywhere. The tutorials are based on experiences across a diverse set of real world cases in various countries, in private and public sectors.
What if you need even more depth?
These SC17 tutorials will deliver a lot of content in each half day. However, if you need more depth, or a fuller range of topics, or are looking for a CV step towards becoming a future HPC manager, then our joint TACC-NAG summer training institute is the right thing for you: "Where will future HPC leaders come from?"
Hope to see you at one (or more!) of our tutorials at SC17 this November in Denver.
@hpcnotes
Labels:
business case funding,
hpc,
procurement,
SC16,
sc17,
supercomputer,
tco,
tutorials
Subscribe to:
Posts (Atom)