High Performance Storage System

HPSS Logo
Incremental Scalability
Based on storage needs and deployment schedules, HPSS scales incrementally by adding computer, network and storage resources. A single HPSS namespace can scale from petabytes of data to exabytes of data, from millions of files to billions of files, and from a few file-creates per second to thousands of file-creates per second.
About HPSS   :    HPSS Big Picture - metadata storage



RHEL Platform   |   Metadata Storage   |   Disk and Tape Storage   |   User Interfaces
HPSS File Operations Per Second

Every HPSS file operation requires one or more Db2 database transactions (e.g. an HPSS file create with data requires four Db2 metadata transactions). HPSS services help with planning, provisioning, and configuring Db2 to meet your transaction requirements.

HPSS Backed by Db2

Metadata describing the files in HPSS are managed by Db2 relational databases. Each relational database has tables and indexes, transaction log files, archived transaction log files, and database backup files. All Db2 transaction log files and the Db2 database backup files are often called the Db2 recovery objects.

Mandatory HPSS Redundancy

All Db2 database files must be protected by RAID storage: (1) Db2 tables and index files; (2) Db2 transaction log files; (3) Db2 log archive files; and (4) Db2 backups files. The Db2 transaction log files are further protected by configuring Db2 log mirroring to make a second copy on a separate RAID-device. The Db2 log archive files are also mirrored to a separate RAID-device. All Db2 recovery files must be protected using your existing site disaster recovery plan and calls for the HPSS Db2 files to be stored on remote-disk or remote-tape. Your site's backup serves as the second copy of the Db2 database backup files. It is not acceptable to store the Db2 backups into HPSS.

Optional HPSS Metadata Redundancy

When Db2 metadata must survive a complete chassis failure, a second local server is provisioned with the RAID-device for the second copy of Db2 recovery objects discussed earlier. The separate chassis can be mounted in a separate rack to protect the HPSS metadata from a rack failure.

Db2 log shipping is used to protect HPSS metadata from a wider range of failures, including a site failure, and to help reduce downtime caused by these failures. Db2 log shipping is used to replicate HPSS Db2 metadata spanning geography.

HPSS Db2 Capacity

Here is an example of Db2 storage for 1 billion HPSS files in a single chassis metadata configuration:
  1. High IOPS storage (at least two RAID devices)
    1. 1 TB RAID: Db2 transaction log files and half of the Db2 tables and indexes
    2. 1 TB RAID: Db2 transaction log mirror files and half of the Db2 tables and indexes
  2. High-bandwidth storage (at least two RAID devices)
    1. 4 TB RAID: Db2 archived transaction log files and half of the Db2 database backup files
    2. 4 TB RAID: Db2 archived transaction log mirror files and half of the Db2 database backup files
  3. Integrating Db2 recovery files into site backup
    1. Db2 recovery files are typically sent to tape using backup software (e.g. Bacula)
    2. 8 TB stores about seven days of Db2 recovery files
SSD vs. HDD for metadata storage

The solid-state disks (SSD) are cost-effective for the high IOPS (Db2 tables, indexes and logs), high bandwidth (for faster backup/restore times), and low bit-error-rate (high integrity) requirements of HPSS Db2. The new NVMe (non-volatile memory express) is a host controller interface and storage protocol created to accelerate the transfer of data to-and-from solid-state drives (SSDs) over a computer's high-speed Peripheral Component Interconnect Express (PCIe) bus and can be found on modern RHEL Platforms. SSDs deliver tens of thousands of write IOPS, hundreds of thousands of read IOPS, and transfer rates approaching a gigabyte per second per device. Spinning hard disk drives (HDD) are an ideal solution for high capacity data that are not frequently accessed, and are no longer ideal for HPSS Db2 metadata.

< Home

Come meet with us!
HPSS @ ISC 2022
ISC 2022 is the event for high performance computing, machine learning, and data analytics, and will be in Frankfurt, Germany from May 29th through June 2nd, 2022 - Learn More. As we have done each year (pre-pandemic), we are scheduling and meeting with folks attending the conference. Please contact us to schedule a HPSS Single Client Briefing to meet with the IBM business and technical leaders of HPSS.

HPSS @ STS 2022 - postponed
The 4th Annual Storage Technology Showcase is in the planning phase, but HPSS expects to support the event later this year. Check out their web site - Learn More.

HPSS @ MSST 2022
The 37th International Conference on Massive Storage Systems and Technology will be in Santa Clara, California in May of 2022 - Learn More. Please contact us if you would like to meet with the IBM business and technical leaders of HPSS at Santa Clara University.

2022 HUF
The 2022 HPSS User Forum (HUF) is in the planning phase. The 2021 HUF was hosted online for six days spread across three weeks in October 2021 with no admission cost. We are planning to meet in person this year. Please check back next quarter for details. This will be a great opportunity to hear from HPSS users, collaboration developers, testers, support folks and leadership (from IBM and DOE Labs). Please contact us if you are not a customer but would like to attend.

HPSS @ SC22
The 2022 international conference for high performance computing, networking, storage and analysis will be in Dallas, TX from November 14th through 17th, 2022 - Learn More. As we have each year (pre-pandemic), we are scheduling and meeting with customers via IBM Single Client Briefings. Please contact your local IBM client executive or contact us to schedule a HPSS Single Client Briefing to meet with the IBM business and technical leaders of HPSS.

What's New?
HPSS 9.3 Release - HPSS 9.3 was released on December 14th, 2021 and introduces eight new features and numerous minor updates.

HUF 2021 - The HPSS User Forum was hosted virtually at no cost in October 2021.

DOE Announces HPSS Milestone - Todd Heer, Deputy Program Lead, Advanced Simulation and Computing (ASC) Facilities, Operations, and User Support (FOUS), announced that DOE High Performance Storage Systems (HPSS) eclipse one exabyte in stored data.

Atos Press Release - Atos boosts Météo-France’s data storage capacity to over 1 exabyte in 2025 to improve numerical modeling and climate predictions. Want to read more?

HPSS 9.2 Release - HPSS 9.2 was released on May 11th, 2021 and introduces eight new features and numerous minor updates.

Capacity Leader - ECMWF (European Center for Medium-Range Weather Forecasts) has a single HPSS namespace with over 650 PB spanning over 439 million files.

File-Count Leader - LLNL (Lawrence Livermore National Laboratory) has a single HPSS namespace with over 70 PB spanning 1,638 billion files.

Older News - Want to read more?
  • LLNL"
  • LANL"
  • NERSC"
  • ORNL"
  • Sandia"
  • IBM"
  • ANL"
  • Boeing"
  • BNL"
  • CEA"
  • CNES"
  • DWD"
  • DKRZ"
  • ECMWF"
  • PNNL
  • HLRS"
  • IU"
  • IITM"
  • IN2P3"
  • JAXA"
  • KEK"
  • KIT"
  • Met
  • MPCDF"
  • Meteo
  • NASA
  • NASA
  • NCMRWF"
  • NOAA
  • NOAA
  • NOAA
  • NOAA
  • Purdue"
  • SciNet"
  • SSC"
  • SLAC"
  • UTAS"
Home    |    About HPSS    |    Services    |    Contact us
Copyright 1992 - 2021, HPSS Collaboration. All Rights Reserved.