# WLCG Site Monitoring Migration MONIT team - 02.07.2020 --- ## Recap: Report visible changes * Only two outputs: * PDF and JSON * Federation availability accounts all the sites * Even if one site is showing 0% available * No data/Unknown doesn't account for availability * If a site can't be monitored its availability will decrease --- ## Recap: New tools * [New website](http://monit-wlcg-sitemon.web.cern.ch/monit-wlcg-sitemon/) * [New recomputation mechanism](https://gitlab.cern.ch/monitoring/site-monitoring-recomputations) (Access granted on demand) * [Historical Profiles](https://monit-grafana.cern.ch/d/000000619/wlcg-sitemon-historical-profiles?orgId=20) * Provide status change and availability stats * [Historical tests](https://monit-grafana.cern.ch/d/m7XtZsEZk4/wlcg-sitemon-historical-tests?orgId=20) * Provides all tests and access to the "logs" (test details) * [Latest tests](https://monit-grafana.cern.ch/d/A_1kxGmMz/wlcg-sitemon-latest-tests?orgId=20) * Provides the last test for a given endpoint-metric --- ## Status update * Initial plan already presented two months ago * [Presentation](https://indico.cern.ch/event/915551/contributions/3849550/attachments/2034111/3405252/WLCG_Site_Monitoring_Migration.pdf) * Already covered sending May reports from both infrastructure * Experiments representatives were met * Reports already sent to site managers * Started working on implementing users feedback --- ## Work done/ongoing * :white_check_mark: Fixed several UI "bugs" * Empty tables, wrong queries... * :white_check_mark: Allow VO wide recomputations * :white_check_mark: Provide availability/reliability plots * Can be used to see statistics per vo, federation, site... * :white_check_mark: Added new profiles * :large_orange_diamond: Drilldown functionality * Adapt Grafana plugin to provide it * :large_orange_diamond: Visual representation of test validity --- ## Next steps * Extend the transition phase with reports from SAM and SiteMon from 2 to 3 months * Provide site managers one extra month for validation * Numbers generated by SiteMon are already good * Based in the new agreements * Complete the implementation and deployment of all user feedback --- ## Next steps (II) * July * 03: June draft reports from **SAM3** and **SiteMon** infrastructure * 16: June final reports from **SAM3** infrastructure * August * 03: July draft reports from **SAM3** and **SiteMon** infrastructure * 17: July final reports from **SiteMon** infrastructure * 17: Stop old dashboards but keep infrastructure running * September * 31: Retire the old infrastructure (dashboards and reports) (Delay initial plan by one month) --- ## Thank You http://cern.ch/monit-support ---
{"type":"slide","slideOptions":{"theme":"cern6"}}