Seventh Annual Texas A&M Research Computing Symposium

Last Update: May 21, 2024

Symposium Details

Dates: May 20-24, 2024
Location: Talks and Workshops - Texas A&M Innovative Learning Classroom Building (ILCB), College Station, TX (map)
Questions? Call us at (979) 458-8414 or email events@hprc.tamu.edu

Texas A&M's High Performance Research Computing is hosting a series of talks and Workshops May 20-24, 2024 to showcase the A&M community’s work in high-performance computing and data-intensive research. There will be an opportunity for students to present their work at the poster session.

New: Future of AI Sessions

Keynote talks:
Bob Crovella, NVIDIA
Trey Tinnell, IBM
Lunch Panel discussion:
Jim Colson, AVP Digital Health, Texas A&M Health Science Center
Kevin Nowka, Professor of Practice, Electrical and Computer Engineering, Texas A&M University
Seth C. Murray, Eugene Butler Endowed Chair in Agricultural Biotechnology, Texas A&M University
Frank Lee, IBM Distinguished Engineer

Keynote talks:

Katie Antypas, Director of the National Science Foundation's (NSF) Office of Advanced Cyberinfrastructure (OAC)

Richard Gerber, Senior Science Advisor at the National Energy Research Scientific Computing Center (NERSC)

Sandra Gesing, Executive Director of the US Research Software Engineer Association (US-RSE)

Banquet and Poster Session with a prize for best poster

Speakers

Prof. Freddie Freddie Witherden, Ocean Engineering, Texas A&M University
Prof. Debjyoti Banerjee, Mechanical Engineering, Texas A&M University
Prof. James C. Sacchettini, Biochemistry and Biophysics, Texas A&M University
Dr. Lars Koesterke, Texas Advanced Computing Center
Prof. Suxia Cui, Electrical and Computer Engineering, Prairie View A&M University
Prof. Prabir Daripa, Mathematics, Texas A&M University
Mahsa Lotfollahi, NVIDIA Solutions Architect - AI
Arianna Martin, HPC Platform Engineer, bp Center for HPC, NAG HPC Services
Prof. Phanourios Tamamis, Chemical Engineering, Texas A&M University
Prof. Paul S. Prueitt, Mathematics, Diné College

Workshops on R, Containers, MATLAB, Deep Learning, CUDA and AlphaFold

Data Explorations with R, Instructor: Dr. Wesley Brashear, Texas A&M High Performance Research Computing
Containers for HPC Reproducibility: Building, Deploying, and Optimizing, Instructor: Dr. Carlos del-Castillo-Negrete, Texas Advanced Computing Center
Charliecloud Container Training, Instructor: Megan Phinney, Los Alamos National Laboratory
Accelerate seismic facies classification using wavelet-informed deep learning, Instructor: Ayon K. Dey, PhD, MathWorks
CUDA Programming: Basic Concepts in C and Fortran, Instructor: Dr. Lars Koesterke, Texas Advanced Computing Center
Fundamentals of Accelerated Computing with CUDA C/C++, NVIDIA DLI Workshop, Instructor: Jian Tao, PhD, Texas A&M University
Introduction to AlphaFold for 3D Protein Structure Prediction

Keynote Talks

Broadening Access to AI Resources through the National AI Research Resource (NAIRR) Pilot

Katie Antypas, Director of the National Science Foundation's (NSF) Office of Advanced Cyberinfrastructure (OAC)

Image of Katie Antypas

Bio: Katie is the Director of the NSF's Office of Advanced Cyberinfrastructure which is responsible for supporting and coordinating the development and deployment of advanced computing and data research infrastructure, tools, services, and training for the research and education community. Prior to joining NSF Katie was at the National Energy Research Scientific Computing (NERSC) Center at Lawrence Berkeley National Laboratory for 17 years in a variety of roles including NERSC Division Deputy, Project Director for NERSC's large scale High Performance Computing system acquisitions, Director of Hardware and Integration of the Exascale Computing Project, Data Department Head and User Services Group Lead. Before coming to NERSC in 2006, Katie worked at the Flash Center at the University of Chicago on the FLASH code, a highly scalable, parallel, adaptive mesh refinement astrophysics application. She has an M.S. in Computer Science from the University of Chicago and a bachelors in Physics from Wellesley College.



Opportunities in High Performance Computing at NERSC and the U.S. Department of Energy

Richard Gerber, Senior Science Advisor at the National Energy Research Scientific Computing Center (NERSC)

Image of Richard Gerber

Abstract: The rapid pace of change in High Performance Computing creates several challenges, but even more so it creates a wealth of new opportunities. In this talk I’ll describe how NERSC and other DOE facilities, along with the Exascale Computing Project, have created new capabilities for researchers who use modeling and simulation, discovery through data, and AI for scientific discovery. I’ll also discuss opportunities for expanding the realm of DOE HPC to engage a broader and more connected community of researchers and cyberinfrastructure professionals.

Bio: Richard Gerber is the Senior Science Advisor at the National Energy Research Scientific Computing Center (NERSC) at Lawrence Berkeley National Laboratory. He also currently serves as Director of Hardware and Integration for the DOE's Exascale Computing Project. Richard has worked with leading-edge HPC systems for more than 30 years, starting with graduate work at the National Center for Supercomputing Applications (NCSA), through a National Research Council Postdoctoral Fellowship at NASA-Ames Research Center, and as a staff member at NERSC since 1996. Richard holds a B.S. in Physics from the University of Florida, and a Ph.D. in Physics from the University of Illinois at Urbana-Champaign.

From 1995 to February 2024, Richard was the NERSC HPC Department Head, managing the User Engagement, Application Performance, Future Technologies, Programming Environment and Models, and Business Operation Services groups. As Senior Science Advisor, he works as a liaison between NERSC and the scientific community and manages the Director's Discretionary Reserve allocations, which are directed to projects that advance strategic goals. In that role he also works closely with program managers in the six Office of Science scientific program offices to help them achieve their advanced scientific computing goals. Richard is on the Steering Committee for the PEARC Conference Series, an active Coalition for Academic Scientific Computation (CASC) member, and part of two DOE post-exascale software sustainability projects.



US-RSE: Empowering Hidden Contributors Driving Science

Sandra Gesing, Executive Director of the US Research Software Engineer Association (US-RSE)

Picture of Sandra Gesing

Abstract: Over the past decade, academia and national labs have increasingly recognized the crucial role of hidden contributors contributing to accelerating science. The acknowledgement is evident in quite some projects. From the founding of 8 Research Software Engineer Associations worldwide to the dedicated efforts of the NSF Center of Excellence for Science Gateways. While it is encouraging that the importance of research software and the people being in this line of work receive more attention, we still have a long road in front of us for well-defined career paths and incentives. A multi-facet approach is needed to meet researchers and educators as well as the hidden contributors where they are. This talk will delve into the crucial role of research software engineers in advancing research and computational activities. Furthermore, It will highlight the importance of fostering a community that encompasses all stakeholders in academia and national labs, advocating for a cultural change and actionable measures on how everyone can contribute to make it happen.

Bio: Sandra Gesing is the inaugural Executive Director of the US Research Software Engineer Association and a Senior Researcher at the San Diego Supercomputer Center. Her research focuses on science gateways, computational workflows as well as distributed and parallel computing. She is especially interested in sustainability of research software, usability of computational methods and reproducibility of research results. Sustainability of research software has many facets and she advocates for improving career paths for research software engineers and facilitators and for incentivizing their work via means beyond the traditional academic rewarding system. Before her positions at US-RSE and SDSC, she was a senior research scientist at the Discovery Partners Institute (DPI), University of Illinois System, Chicago and she was an associate research professor at the University of Notre Dame, Indiana, US. Before she moved to the US, she was a research associate at the University of Edinburgh, UK, and at the University of Tübingen, Germany. Additionally, she has perennial experience as a project manager and system developer in industry in the US and Germany. As head of a system programmer group, she has long-term software projects. She received her Master’s degree in computer science from extramural studies at the FernUniversität Hagen and her PhD in computer science from the University of Tübingen, Germany.

Workshops

Data Explorations with R

Monday, May 20 10:00AM-3:00PM

Instructor: Dr. Wesley Brashear, Texas A&M High Performance Research Computing

This workshop will cover a variety of topics related to conducting data science in R, including basic data structures in R, importing and exporting data, data manipulation with the dplyr package, regression analysis, principal components analysis and data visualization. Attendees will work through interactive coding sessions to achieve the following learning objectives:

Learning Objectives:
  • Understand how data in R is structured in lists, vectors, data frames, and tibbles
  • Understand basic R operators and built-in functions
      Demonstrate how to
    • subset/slice different data structures in R
    • manipulate data sets with the dpylr package
    • import and export data in RStudio
    • conduct linear regression and principal component analyses
  • Generate scatter plots, histograms, and heatmaps with the ggplot2 package
Prerequisites:
  • ACCESS ID
  • Basic understanding of navigating the Linux command line
  • Basic knowledge/skills in any interpreted language (e.g. R, Python) will be beneficial, but not critical

About the Instructor: Dr. Wesley Brashear is an Assistant Research Scientist with High Performance Research Computing at Texas A&M University. His background is in genetics and bioinformatics and has extensive expertise in R.

Containers for HPC Reproducibility: Building, Deploying, and Optimizing

Tuesday, May 21 9:00AM-12:00PM

Instructor: Dr. Carlos del-Castillo-Negrete, Research Associate, Texas Advanced Computing Center

Software containers have gained immense popularity as an effective means for enhancing the portability and reproducibility of high-performance computing (HPC) applications.

This hands-on workshop is designed to provide a comprehensive introduction to building, running, and distributing Docker and Singularity/Apptainer containers specifically tailored for HPC environments. Participants will receive in-depth guidance on creating HPC-compliant containers that are not only suitable for local testing and development but also optimized for deployment on advanced HPC systems.

Presentation Slides: (pdf)

Special attention will be given to the intricacies of building containers that can run on multiple nodes using MPI and using accelerated hardware such as GPUs. Moreover, the workshop will delve into best practices for containerization in HPC contexts, troubleshooting common issues, and strategies for ensuring the seamless integration of containers into existing HPC workflows.



By the end of the session, attendees will be equipped with the knowledge and skills necessary to leverage the full potential of HPC containers, contributing to more efficient, reproducible, and scalable computing solutions.

Prerequisites: ACCESS ID

Learning Objectives:
  • Docker and Singularity/Apptainer basics
  • Containerizing your own code
  • Multi-stage and Multi-architecture Docker builds
  • Running containers on HPC systems, including MPI parallelism and GPU enabled containers

About the presenter: Dr. Carlos del-Castillo-Negrete is a Research Associate in the High Performance Computing Group at the Texas Advanced Computing Center. His research interests center around developing HPC applications for machine learning and uncertainty quantification, particularly in the realm of climate and natural hazards modeling.

CUDA Programming: Basic Concepts in C and Fortran

Tuesday, May 21 9:00AM-12:00PM

Instructor: Dr. Lars Koesterke, Research Associate, Texas Advanced Computing Center

This workshop is designed to teach the fundamentals of CUDA programming for those with a background in programming (C/C++/Fortran) but little to no exposure to CUDA programming.

Presentation Slides: (pdf)

Learning Objectives:
  • Basic CUDA programming concepts: ‘software matching hardware’
  • Performance optimization and techniques beneficial under most circumstances

Prerequisites: Basic knowledge of C/Fortran, or generally of a programming language, ACCESS ID

About the presenter: Dr. Lars Koesterke joined TACC is as a Research Associate in the High Performance Computing group at TACC. Before coming to TACC, he held positions at the Astronomy Department at The University of Texas at Austin, NASA's Goddard Space Flight Center and the Universities of Potsdam and Kiel (both Germany). His work at TACC is focused on performance evaluation and optimization.

Charliecloud Container Training

Tuesday, May 21 1:00PM-4:00PM

Instructor: Megan Phinney, Los Alamos National Laboratory

This workshop will provide participants with background and hands-on experience to use basic Charliecloud containers for HPC applications. We will discuss what containers are, why they matter for HPC, and how they work. We’ll give an overview of Charliecloud, the unprivileged container solution from Los Alamos National Laboratory’s HPC Division. Participants will build toy containers and a real HPC application, and then run them in parallel on a supercomputer. This will be a highly interactive workshop with lots of Q&A.

Presentation Slides: (pdf)

Exercises: (pdf)

About the presenter: Megan Phinney is a computer scientist in the High Performance Computing – Environments group at Los Alamos National Laboratory, where she provides user support for the institutional HPC systems. Her research interests include secure user space container environments and performance of file systems.

Accelerate seismic facies classification using wavelet-informed deep learning

Tuesday, May 21 1:00PM-3:30PM

Hosted by MathWorks

Instructor: Ayon K. Dey, PhD, Senior Application Engineer, MathWorks

Interpretation of geological facies from large seismic datasets is one of the key steps in modeling and simulating subsurface reservoirs to support oil & gas exploration, field development planning, unconventional play development, and carbon storage – just to name a few use cases. MATLAB enables the creation of a fast and robust workflow to classify facies using large seismic datasets by integrating unique geophysical and deep learning methodologies.

Prerequisites: ACCESS ID

Attendees will learn:
  • How to accelerate seismic facies classification by integrating geophysics (multi-resolution analysis or MRA) and deep learning (recurrent neural networks or RNN) while reducing overfitting with unique wavelet signals;
  • How to reduce computing time using GPU computing capabilities on very large seismic datasets;
  • How RNN proves to be more effective to classify seismic facies compared to other deep learning methods (e.g., convolutional neural networks or CNN) based on results from blind tests conducted at the 2021 SEAM Hackathon.

About the presenter: Ayon K. Dey is a Senior Application Engineer at MathWorks based in Texas. Ayon’s experience spans over 30 years working in multiple Geosciences roles for oil, gas, and chemical companies based in Canada, USA, Saudi Arabia, South Africa, and UAE, as well as for research institutions in Canada and The Netherlands. Ayon has 17 expanded abstract and journal publications in technical geoscience; holds a Ph.D. degree in Applied Physics from Delft University of Technology, an M.Sc. degree in Geology and Geophysics from University of Calgary, and a B.Sc. in Applied Mathematics from Memorial University of Newfoundland.

Introduction to AlphaFold for 3D Protein Structure Prediction

Tuesday, May 21 1:00 PM - 3:30 PM

Instructors: Dr. Michael Dickens, Texas A&M High Performance Research Computing and Dr. Devon Boland, Texas A&M Institute for Genome Sciences & Society

In this workshop, participants will learn how to run AlphaFold and about the fundamentals of AlphaFold as applied to real world applications.

Presentation Slides: (pdf)

Prerequisites: ACCESS ID

Topics will include:
  • Introduction to AlphaFold
  • Running AlphaFold on Grace
  • AlphaFold Workflows
  • Visualizing AlphaFold results

Fundamentals of Accelerated Computing with CUDA C/C++, NVIDIA DLI Workshop

Friday, May 24 8:00AM-5:00PM

Instructor: Jian Tao, PhD, Assistant Professor, School of Performance, Visualization & Fine Arts, Texas A&M University.

This workshop teaches the fundamental tools and techniques for accelerating C/C++ applications to run on massively parallel GPUs with CUDA®. You’ll learn how to write code, configure code parallelization with CUDA, optimize memory migration between the CPU and GPU accelerator, and implement the workflow that you’ve learned on a new task—accelerating a fully functional, but CPU-only, particle simulator for observable massive performance gains. At the end of the workshop, you’ll have access to additional resources to create new GPU-accelerated applications on your own.

Learning Objectives:
At the conclusion of the workshop, you’ll have an understanding of the fundamental tools and techniques for GPU-accelerating C/C++ applications with CUDA and be able to:
  • Write code to be executed by a GPU accelerator
  • Expose and express data and instruction-level parallelism in C/C++ applications using CUDA
  • Utilize CUDA-managed memory and optimize memory migration using asynchronous prefetching
  • Leverage command line and visual profilers to guide your work
  • Utilize concurrent streams for instruction-level parallelism
  • Write GPU-accelerated CUDA C/C++ applications, or refactor existing CPU-only applications, using a profile-driven approach
Why Deep Learning Institute Hands-On Training?
  • Learn to build deep learning and accelerated computing applications for industries such as autonomous vehicles, finance, game development, healthcare, robotics, and more.
  • Obtain hands-on experience with the most widely used, industry-standard software, tools, and frameworks.
  • Gain real-world expertise through content designed in collaboration with industry leaders such as the Children’s Hospital of Los Angeles, Mayo Clinic, and PwC.
  • Earn an NVIDIA DLI certificate to demonstrate your subject matter competency and support career growth.
  • Access content anywhere, anytime with a fully configured, GPU-accelerated workstation in the cloud.

Full Schedule (All times are CT), subject to change

Last Update: May 21, 2024

Monday, May 20 Future of AI Sessions (ILCB 224 map)
9:00 AM - 10:00 AM Check-in
10:00 AM - 10:50 AM Keynote: Roadmap and Future of AI
Bob Crovella, NVIDIA
10:50 AM - 11:00 AM Break
11:00 AM - 11:20 AM Containers for Research Focus Session
Frank Lee, IBM
11:20 AM - 11:30 AM Break
11:30 AM - 11:50 AM Lunch Sponsored by IBM
11:50 AM Panel Introductions
Dan Gillchrist, Supermicro
11:50 AM - 12:30 PM Lunch Panel on the future of AI
Moderator: Daryl Williams (IBM)
Panelists: Frank Lee (IBM Distinguished Engineer), Jim Colson (AVP Digital Health, Texas A&M Health Science Center), Kevin Nowka (Professor of Practice, Texas A&M Electrical and Computer Engineering), and Seth C. Murray (Eugene Butler Endowed Chair in Agricultural Biotechnology, Texas A&M University
12:30 PM - 12:45 PM Break
12:45 PM - 1:05 PM Research Workbench Focus Session
Frank Lee, IBM
1:05 PM - 1:15 PM Break
1:15 PM - 2:00 PM Keynote: LLMs, Studios, Governance and More
Trey Tinnell, IBM
2:00 PM Closing Remarks
Nathan Goodnight, Red Hat
Monday, May 20 Workshop (ILCB 237 map)
9:00 AM - 10:00 AM Check-in and Account Support
10:00 AM - 11:30 AM Data Explorations with R
11:30 AM - 12:30 PM Lunch
12:30 PM - 3:00 PM Data Explorations with R (cont.)
Tuesday, May 21 Workshops (ILCB 224, 233, & 237 map)
8:30 AM - 9:00 AM Check-in and Account Support
Morning Concurrent Sessions
9:00 AM - 12:00 PM Containers for HPC Reproducibility: Building, Deploying, and Optimizing (ILCB 237)
9:00 AM - 12:00 PM CUDA Programming: Basic Concepts in C and Fortran (ILCB 233)
12:00 PM - 1:00 PM Lunch and Account Support
Afternoon Concurrent Sessions
1:00 PM - 4:00 PM Charliecloud Container Training (ILCB 237)
1:00 PM - 3:30 PM Accelerate seismic facies classification using wavelet-informed deep learning (ILCB 233)
1:00 PM - 3:30 PM Introduction to AlphaFold for 3D Protein Structure Prediction (ILCB 224)
Wednesday, May 22 Keynote, Research Talks (ILCB 224 map) Banquet and Poster Session (ILSB Lobby map)
9:00 AM - 9:50 AM Check-in
9:50 AM - 10:00 AM Opening Remarks
10:00 AM - 10:30 AM Porting the PyFR Framework to Intel PVC
Prof. Freddie Witherden, Ocean Engineering, Texas A&M University
10:30 AM - 11:00 AM Leveraging Machine Learning (ML) and Advanced Manufacturing (AM) for Enhancing Thermal Energy Storage (TES) using Molten Salt Nanofluids and Phase Change Materials (PCM)
Prof. Debjyoti Banerjee, Mechanical Engineering, Texas A&M University
11:00 AM - 12:00 PM Keynote: Opportunities in High Performance Computing at NERSC and the U.S. Department of Energy
Dr. Richard Gerber, Senior Science Advisor and HPC Department Head at the National Energy Research Scientific Computing (NERSC) Center
12:00 PM - 1:00 PM Lunch Sponsored by DDN
1:00 PM - 2:00 PM Keynote: Broadening Access to AI Resources through the National AI Research Resource (NAIRR) Pilot
Katie Antypas, Director of the National Science Foundation's Office of Advanced Cyberinfrastructure
2:00 PM - 2:30 PM Computer aided drug discovery
Prof. James C. Sacchettini, Biochemistry and Biophysics, Texas A&M University
2:30 PM - 2:50 PM Break
2:50 PM - 3:20 PM TACC Roadmap and Trends in HPC
Dr. Lars Koesterke, Texas Advanced Computing Center
3:20 PM - 3:35 PM Computer Vision Research on HPRC Computing Resources
Prof. Suxia Cui, Electrical and Computer Engineering, Prairie View A&M University
4:00 PM Posters should be on display in the ILSB Lobby for Judges
5:00 PM - 7:00 PM Banquet and Poster Session
ILSB Lobby map
Thursday, May 23 Keynote, Research Talks (ILCB 224 map)
9:00 AM - 10:00 AM Check-in
10:00 AM - 10:30 AM Mathematical and computational challenges in chemical enhanced oil recovery
Prof. Prabir Daripa, Mathematics, Texas A&M University
10:30 AM - 11:00 AM Accelerated Data Science with RAPIDS
Mahsa Lotfollahi, NVIDIA Solutions Architect - AI
11:00 AM - 12:00 PM Keynote: US-RSE: Empowering Hidden Contributors Driving Science
Sandra Gesing, Executive Director of the US Research Software Engineer Association (US-RSE)
12:00 PM - 1:00 PM Lunch
1:00 PM - 2:00 PM The Perpetual Stew: A Humanist Perspective on the Modern HPC Datacenter
Arianna Martin, HPC Platform Engineer, bp Center for HPC, NAG HPC Services
2:00 PM - 2:30 PM Computational studies in the design of materials for applications in drug delivery and environmental remediation
Prof. Phanourios Tamamis, Chemical Engineering, Texas A&M University
2:30 PM - 2:50 PM Break
2:50 PM - 3:20 PM Why Stratified Computing is Important
Prof. Paul S. Prueitt, Mathematics, Diné College
3:20 PM - 5:00 PM Researcher Lightning Talks
Friday May 24 NVIDIA DLI Workshop (ILSB Auditorium (1105) map)
8:00 AM - 12:00 PM Fundamentals of Accelerated Computing with CUDA C/C++
12:00 PM - 1:00 PM Lunch Sponsored by NVIDIA
1:00 PM - 5:00 PM Fundamentals of Accelerated Computing with CUDA C/C++ (Cont.)

Posters


Thanks to our Sponsors!

Symposium Sponsor Logos

Texas A&M Research Computing Symposium is supported in part by NSF award #1925764, CC* Team: SWEETER -- SouthWest Expertise in Expanding, Training, Education and Research and NSF award #2112356, ACSS: ACES - Accelerating Computing for Emerging Sciences.