# <img src="https://codimd.web.cern.ch/uploads/upload_45a14e417e9a8ade007f06e7b9420356.png" style="border: none;background: none;box-shadow:none"> initial deployments
[Julien Leduc](mailto:julien.leduc@cern.ch)
---
## Data archiving at CERN
<ul>
<li class="fragment">Ad aeternum storage</li>
<li class="fragment">7 tape libraries, 83 tape drives, 20k tapes</li>
<li class="fragment">Current use: <b style="color:dodgerblue;">330 PB</b></li>
<li class="fragment">Current capacity: <b style="color:coral;">0.7 EB</b></li>
<li class="fragment"><b style="color:red;">Exponentially growing</b></li>
</ul>
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_95716d3602c009e301c880b0afd4225a.png" data-background-size="80%" -->
---
<h2>Data Archiving at CERN <span class="fragment"><i style="color:blue;">Evolution</i></span></h2>
<ul>
<li class="fragment">EOS + tapes...</li>
<ul>
<li class="fragment">EOS is CERN strategic storage platform</li>
<li class="fragment">tape is the strategic long term archive medium</li>
</ul>
<li class="fragment">EOS + tapes = <span class="fragment" style="color:red;">♥</span></li>
<ul>
<li class="fragment">Meet CTA: CERN Tape Archive</li>
<li class="fragment">Streamline data paths, software and infrastructure</li>
</ul>
</ul>
---
<h2>EOS+CTA <span class="fragment"><i style="color:blue;">Deployment</i></span></h2>
----
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_d361eb4b4ad42029bd3d998a1600cfa0.png" data-background-size="70%" -->
----
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_d2d164112f95cfd9fa22d4532281323e.png" data-background-size="70%" -->
----
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_7d8fb723c75a802eb77a6e53037afe26.png" data-background-size="70%" -->
---
<h2>EOS+CTA <span class="fragment"><i style="color:blue;">Architecture</i></span></h2>
----
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_eac32c76dde5a45191434a90d54a4d5a.png" data-background-size="70%" -->
---
<h2>EOS+CTA <span class="fragment"><i style="color:blue;">Timeline</i></span></h2>
----
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_0ae96233cb49710754263e2d780a20b6.svg" data-background-size="100%" -->
---
<h2>EOS+CTA <span class="fragment"><i style="color:blue;">Dev&oper</i></span></h2>
<p class="fragment">
Tightly coupled software <span class="fragment">⇒ <span style="color:red;">tightly coupled developments</span></span>
</p>
<p class="fragment">
<span class="fragment highlight-blue">Extensive and systematic testing is paramount to limit regressions<span>
</p>
<p class="fragment">
<span class="fragment highlight-blue">Extensive monitoring</span> in place to <span class="fragment highlight-blue">ease debugging</span> and <span class="fragment highlight-red">target high performance from day 1</span><span>
</p>
----
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_0e38a1afc20ff3b7ce635b01826a4b84.png" data-background-size="70%" -->
----
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_c7b3e43b9d263e3d3c125f604b04801a.png" data-background-size="70%" -->
----
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_1372053cb920f7ea8bb00ca001ab2f88.png" data-background-size="70%" -->
---
## CTA metadata stress tests
**Small EOS instance VTL backed**
<span><!-- .element: class="fragment" data-fragment-index="1" -->- Few retired 1Gb/s disk servers and a decent MGM for <span style="color: dodgerblue">EOS</span></span>
<span><!-- .element: class="fragment" data-fragment-index="2" -->- A few VMs for the <span style="color: dodgerblue">CTA tape part</span></span>
<span><!-- .element: class="fragment" data-fragment-index="3" --><b>Each VTL tape server exposes 20 virtual tape drives and can sustain <span style="color: crimson">>1kHz 1kB files</span></b></span>
----
## CTA metadata stress tests <span class="fragment" data-fragment-index="1"><i style="color:blue;">Stage in</i></span>
<span><!-- .element: class="fragment" data-fragment-index="2" -->Tests until CTA routinely archived <span style="color: dodgerblue">1M files</span></span>
<span><!-- .element: class="fragment" data-fragment-index="3" -->- <b><span style="color: crimson">10M file test</span></b>:</span>
<span><!-- .element: class="fragment" data-fragment-index="4" --><img src="https://codimd.web.cern.ch/uploads/upload_ab41de6dff6da58bff6df00f1e51e723.png" class="plain" width=80%></span>
----
## CTA metadata stress tests <span class="fragment" data-fragment-index="1"><i style="color:blue;">Stage out</i></span>
<span><!-- .element: class="fragment" data-fragment-index="2" -->Tests until CTA routinely retrieved <span style="color: dodgerblue">1M files</span></span>
<span><!-- .element: class="fragment" data-fragment-index="3" -->- <b><span style="color: crimson">4M file test</span></b>:</span>
<span><!-- .element: class="fragment" data-fragment-index="4" --><img src="https://codimd.web.cern.ch/uploads/upload_7a8ae04296e39c5d958f6afda70f3ec5.png" class="plain" width=80%></span>
----
## CTA metadata stress tests <span class="fragment" data-fragment-index="1"><i style="color:blue;">CI</i></span>
<span><!-- .element: class="fragment" data-fragment-index="4" --><img src="https://codimd.web.cern.ch/uploads/upload_0ee146a25b9fe3d26c47515d41ec0477.png" class="plain" width=80%></span>
---
## CTA HI Data challenge
**Bigger EOS instance**
<span><!-- .element: class="fragment" data-fragment-index="1" -->- 10x10Gb/s disk servers and a decent MGM for <span style="color: dodgerblue">EOS</span> (~2.4PB)</span>
<span><!-- .element: class="fragment" data-fragment-index="2" -->- 14xT10KD tape drives for the <span style="color: dodgerblue">CTA tape part</span> (3.5GB/s)</span>
----
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_62abf6f4de8bf484d242c443d22d0f45.png" data-background-size="80%" -->
----
## CTA HI Data challenge <i style="color:blue;">Extended</i>
<h2><span style="color:crimson;">Can we saturate the tape drive speed longer?</span></h2>
----
<!-- .slide: data-background="https://codimd.web.cern.ch/uploads/upload_aefab20fca3dc284c3a8402d23f29d8f.png" data-background-size="80%" -->
---
# <span style="color: dodgerblue">CTA</span> VS <span style="color: crimson">experiment data transfers</span>
----
## ATLAS stage in
Several tests conducted with Atlas DDM team using Rucio and FTS.
- 2 stage in tests of 200TB each
- ~90k files of 2.6GB archived to tape
- sub-optimal EOS instance (2 slow disk servers)
----
## ATLAS stage in
<img src="https://codimd.web.cern.ch/uploads/upload_dfa6cf2e22f47bff0ff9f705a6fbe419.png" class="plain"></span>
<img src="https://codimd.web.cern.ch/uploads/upload_8d18a04f89dfd4626a3c073a48f6717e.png" class="plain"></span>
----
## ATLAS stage out
aka *Tape carousel* test took place during Hepix:
- 3 x EOS disk servers (~3x260TB of raw JBOD space)
- 6-10 x T10KD tape drives
- 90k files retrieved from EOSCTAATLASPPS (tape) to EOSATLAS by rucio through FTS
----
## ATLAS stage out
<img src="https://codimd.web.cern.ch/uploads/upload_cdff0f357f4522aabad54db96a12de84.png" class="plain"></span>
----
## ATLAS stage out
<img src="https://codimd.web.cern.ch/uploads/upload_f08082d31f8d0839404ca282d05d7fa7.png" class="plain"></span>
----
## ATLAS stage out DDM
<img src="https://codimd.web.cern.ch/uploads/upload_5a6394a3c1efa419f01d3c548edbb60e.png" class="plain"></span>
---
# <span style="color: dodgerblue">CTA</span> & <span style="color: crimson">LHC experiments</span>
----
## <span style="color:crimson">ATLAS</span>
- PPS instance is <span style="color:blue">up and running</span>
- Will push a redundant 5% share of production into CTA
----
## <span style="color:crimson">CMS</span>
- PPS instance <span style="color:blue">is up and running</span>
- Starting work with the Rucio team
- no PhEDEx
----
## <span style="color:crimson">ALICE</span>
- ALICE and CERN IT already <span style="color:blue">moved CASTOR behind EOS</span>
- During HI production DAQ went to EOSALICE first
- Agreed to provide a CTA PPS instance
----
## <span style="color:crimson">LHCb</span>
- relies heavily on SRM
- CTA does not provide SRM
- SRMless transfers hiden behing GFAL2
- Agreed to write DAQ data to EOSLHCb instance first
---
<!-- .slide: data-background="https://media.giphy.com/media/26BRqPg05olzXG1bi/giphy.gif" data-background-size="70%" -->
---
<img src="https://codimd.web.cern.ch/uploads/upload_e393b27c3245bb4b8d8e327488433109.png" class="plain"></span>
{"title":"EOSWS2019 CTA initial deployments","description":"EOS workshop 2019 presentation of CTA initial deployment","slideOptions":{"transition":"slide","theme":"white"}}