190927 ITTF/Computing Seminar

# CTA status and plans 27 September 2019 [Julien Leduc](mailto:julien.leduc@cern.ch) for the CTA team --- ## Data archiving at CERN <ul> <li class="fragment">Ad aeternum storage</li> <li class="fragment">Current use: 340 PB</li> <li class="fragment">Exponentially growing</li> <li class="fragment">Run2: 7 tape libraries, 83 tape drives, 30k tapes</li> <li class="fragment">Run3: 4-5 tape libraries, 160+ tape drives, 150PB+/year, >40GB/s</li> </ul>  --- <h2>Data Archiving at CERN Evolution</h2> <ul> <li class="fragment">EOS + tapes...</li> <ul> <li class="fragment">EOS is CERN strategic storage platform</li> <li class="fragment">tape is the strategic long term archive medium</li> </ul> <li class="fragment">EOS + tapes = &hearts;</li> <ul> <li class="fragment">Meet CTA: CERN Tape Archive</li> <li class="fragment">Streamline data paths, consolidate software development and operations</li> </ul> </ul> --- <h2>EOS+CTA Deployment</h2> ----  ----  --- <h2>EOS+CTA Timeline</h2> ----  --- <h2>EOS+CTA Architecture</h2> <ul> <li class="fragment">CTA offers the Best of Both Worlds</li> <ul> <li class="fragment">User interface and file access from EOS</li> <li class="fragment">Tape system management from CASTOR</li> <li class="fragment">New scalable, robust queuing system to link the two</li> </ul> <li class="fragment">CTA design principles</li> <ul> <li class="fragment">Simplicity</li> <li class="fragment">Scalabilty</li> <li class="fragment">Performance</li> </ul> </ul> ---- <h2>EOS+CTA Architecture</h2> Main difference with CASTOR: EOSCTA is a pure tape system. Disk cache duty consolidated in main EOS instance. Operating tape drive at full speed full time efficiently requires a SSD based buffer. ---- <h2>EOS+CTA Architecture</h2> <img src="https://codimd.web.cern.ch/uploads/upload_e764d94a4ee3ac79c328ea0d21a6a128.svg" class="plain"> --- <h2>EOS+CTA Typical operations</h2> <ul> <li class="fragment">Write file to eoscta buffer</li> <li class="fragment">Is file on tape?</li> <li class="fragment">Queue file for retrieve</li> <li class="fragment">Is file in eoscta buffer?</li> <li class="fragment">Read file from eoscta buffer</li> <li class="fragment">Evict file from eoscta buffer</li> <li class="fragment">Delete file from namespace</li> </ul> --- <h2>EOS+CTA Dev&oper</h2> Tightly coupled software ⇒ tightly coupled developments Extensive and systematic testing is paramount to limit regressions Extensive monitoring in place to ease debugging and target high performance from day 1 ----  --- <h2>EOS+CTA Pre Production instances</h2> ---- ## EOSCTAPPS 5 hyperconverged servers: - 16x1TB SSDs, 10Gb/s each - hosting **5 EOSCTA instances** Specific bandwidth oriented EOS setup each server runs: - **1 EOS MGM** - **1 EOS NAMESPACE** - *quarkdb* - **14 EOS DISKSERVERs** - *FSTs* ---- ## EOSCTAPPS <img src="https://codimd.web.cern.ch/uploads/upload_88f3b03ec6cf37aa8c59787b8909d6f6.svg" class="plain"> ---- ## EOSCTAPPS Deployed instances: <ul> <li class="fragment">eosctaatlaspps redundant share of CASTOR writes rule in place</li> <li class="fragment">eosctacmspps tape endpoint for CMS Rucio instance</li> <li class="fragment">eosctaalicepps for Alice</li> <li class="fragment">eosctapps CASTOR migration instance</li> <li class="fragment">eosctarepack for CTA repack activities</li> </ul> --- ## CTA and ATLAS DATA carousel Use tape as input for I/O intensive workflows. <img src="https://codimd.web.cern.ch/uploads/upload_b121d6147230892c509c45a2af072320.png" class="plain"> ---- ## CTA and ATLAS DATA carousel Close collaboration between: - ATLAS DDM team - RUCIO developers - FTS, CTA, xrootd, EOS developers Has been key to ATLAS workflow integration work. MANY THANKS! ---- ## Atlas Archival (april 2019) <img src="https://codimd.web.cern.ch/uploads/upload_9f8be3fca81b9dcdca64e7e87c5befed.png" class="plain"> ---- ## Atlas Recall (june 2019) <img src="https://codimd.web.cern.ch/uploads/upload_b4dd1881c1c1be7126489c41b34edce9.png" class="plain"> ---- ## Atlas Recall (june 2019) inefficiencies <img src="https://codimd.web.cern.ch/uploads/upload_54d5eb635a3fbfca6294fcabec0545ab.png" class="plain"> --- ## CASTOR -> CTA migration ATLAS needed `/castor/cern.ch/grid/atlas/rucio` for the next recall exercise. Migration principles: - metadata only operation - CASTOR data is **RO in CTA** - migration **by tape pool** 90M files were migrated from CASTOR to CTA --- ## ATLAS 2018 recall campaign CTA migration instance `eosctapps`: - 5 hyperconverged servers - 20TB of SSDs - 10 tape drives ---- ## CTA cumulated recall volume <img src="https://codimd.web.cern.ch/uploads/upload_56d9d3f7ba73f6b59dc8b4616e1a2c4f.png" class="plain"> ---- ## CTA share in daily total recall volume <img src="https://codimd.web.cern.ch/uploads/upload_a3e2e74e123fd54914513bba10fe5c22.png" class="plain"> ---- ## CTA daily recall volume <img src="https://codimd.web.cern.ch/uploads/upload_c24024734407c1aada74d3b1be28ae93.png" class="plain"> ----  ---- > From ATLAS side, the recall campaign went smoothly and we managed to recall all the files ( 326k files, 741 TB) in a timely manner (CERN was the first site to achieve the full recall of all the files). This gave us additional confidence in CTA performance and in the migration strategy > > ATLAS DDM --- <h2>EOS+CTA Production instances</h2> Getting ready for the final migration: - final buffer hardware on its way: - racks are cabled - network switches allocated (600Gb/s of BW) - 30 hyper converged machines mid October - Production ready beginning of November ---- ```graphviz graph hierarchy { nodesep=1 // increases the separation between nodes node [color=Red, fontname=Courier, shape=box] //All nodes will this shape and colour edge [color=Blue, label="25Gb/s"] //All the lines look like this Router [shape=circle] Router--{SwitchBuffer} [label="3x(2x100Gb/s)", fontsize=15, style=bold] Router--{SwitchTape} [label="7x20Gb/s", fontsize=15, style=bold] subgraph cluster_level1{ label="EOSCTA Buffer infrastructure\n3x10 hyperconverged servers" color=dodgerblue fontcolor=dodgerblue SwitchBuffer SSD01 [color=black, shape=cylinder] SSDXX [color=black, shape=cylinder] SSD16 [color=black, shape=cylinder] buffersrv01 buffersrvXX--{SSD01 SSDXX SSD16} [label=""] } subgraph cluster_level2{ label="Tape infrastructure\nXX tapeservers" color=crimson fontcolor=crimson SwitchTape SwitchTape--{tpsrv01 tpsrvXX} [color=Blue, label="10Gb/s"] SwitchBuffer--{buffersrv01 buffersrvXX } [color=Blue, label="25Gb/s", style=bold] {rank=same; tpsrv01 tpsrvXX} // Put them on the same level tape [color=black, shape=Msquare] tpsrvXX--tape [label="360MB/s"] } } ``` --- <h2>Status summary</h2> <ul> <li class="fragment">Core developments finished</li> <li class="fragment">Workflow integration in FTS and Rucio (through xrootd API)</li> <li class="fragment">Core operational environment ready</li> <li class="fragment">Extensive internal testing and external validation</li> <li class="fragment">Outside institutes expressed interest and collaborated</li> </ul> WE ARE READY! --- ## Extra slides --- # 2018 recall exercise performance monitoring ----  --- # Workflows for Archival and Retrieval ---- ## Archival ```mermaid sequenceDiagram participant Experiment participant FTS participant EOS participant EOSCTA participant Tape Experiment->>FTS: archive(file) activate EOS FTS->>EOSCTA: xrdcp EOS:file EOS->>+EOSCTA: file loop until timeout FTS->>EOSCTA: file backup_bit ? alt backup_bit=1 EOSCTA-->>+Tape: file deactivate EOSCTA Tape->>FTS: file on tape FTS->>Experiment: file archival OK deactivate Tape else backup_bit=0 activate EOSCTA EOSCTA-xFTS: file NOT on tape FTS->>-EOSCTA: delete file FTS-xExperiment: file archival FAILED end end deactivate EOS ``` ---- ## Retrieval ```mermaid sequenceDiagram participant Experiment participant FTS participant EOS participant EOSCTA participant Tape Experiment->>FTS: retrieve(file) activate Tape FTS->>EOSCTA: xrdfs prepare -s file loop until timeout FTS->>EOSCTA: file online ? alt online_bit=1 Tape->>+EOSCTA: file activate EOSCTA EOSCTA->>FTS: file is online FTS->>EOS: xrdcp EOSCTA:file EOSCTA->>+EOS: file FTS->>EOSCTA: xrdfs prepare -e deactivate EOSCTA FTS->>Experiment: file retrieval OK deactivate EOS else online_bit=0 EOSCTA-xFTS: file is NOT online FTS-xExperiment: file retrieval FAILED end end deactivate Tape ``` ---