BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//CERN//INDICO//EN
BEGIN:VEVENT
SUMMARY:Joint NHR Data Management Training
DTSTART:20241105T074500Z
DTEND:20241106T163000Z
DTSTAMP:20260614T093400Z
UID:indico-event-898@events.gwdg.de
DESCRIPTION:Speakers: Hendrik Nolte (GWDG)\n\nThis is a joint NHR Training
  held by 5 different NHR centers. It consists of  different sessions rela
 ted to a diverse set of challenges arising when doing proper data manageme
 nt within HPC workloads. Although the different sessions will build up on 
 each other\, they can still be taken individually. However\, to efficientl
 y participate in selected sessions\, participants are recommended to have 
 a reasonable familiarity with previously taught concepts. The entire cours
 e will take place online and will span over a period of two days.\nThis co
 urse will start with a basic introduction to data management on HPC system
  and their specific challenges. This includes the concept of storage tieri
 ng\, and how HPC workflows can be designed to optimally utilize them. Impo
 rtant permission concepts to efficiently organize larger consortia and iso
 late different users within their own\, well-defined space along with furt
 her techniques for data sharing and data cataloging are also explained. Al
 l of these concepts are supplemented by hands-on sessions. \nThen\, furth
 er details on metadata and their extraction are given\, followed by the in
 troduction of dedicated data management systems\, with a specific focus on
  Coscine.\nThe second day starts with a deep dive into the Research Data M
 anagement Organizer (RDMO)\, a well-established tool for creating Data Man
 agement Plans (DMP). \nThe course concludes with a detailed and holistic 
 overview of storage systems. It starts with explanations on the meaning of
  I/O\, inodes\, and files. Differences between local file systems (like ex
 t4) and parallel filesystems (like BeeGFS or Lustre) and their implication
 s are stated. Then different access patterns for parallel I/O are introduc
 ed\, and tools like Darshan and Score-P to for analysis are demonstrated. 
 This session concludes with a summary of I/O best practices. \nEveryone c
 an join this course for free thanks to the funding received from the “Na
 tionales Hochleistungsrechnen” by the Project ”Large Scale Data Manage
 ment”.\n \n\nhttps://events.gwdg.de/event/898/
LOCATION:BigBlueButton (Online)
URL:https://events.gwdg.de/event/898/
END:VEVENT
END:VCALENDAR
