Issues with Jobs failing or running slowly
- Gather livy pod logs
- Gather master-job-scheduler logs
- A screenshot of the Jobs page in SDF UI
- Gather job logs from jobs page for any problematic jobs
- The output from ./scripts/collect-system-status-info.sh
Issues with Deployments, Upgrades, or Cluster Health
- Gather logs from pods that are not in Running status for all namespaces
- The cluster-configuration.yaml file
- The output from ./scripts/collect-system-status-info.sh
Issues with Elastic Search / Data Stream / Global Search
- Gather data-transformation-service pod logs
- Gather primary-api logs
- Metrics regarding the data
-- How many datasets,
-- How many compounds
-- How many measurement types
-- How many files per dataset
-- How many maps
-- Number of measurement rows
-- Number of assay results (if they use secondary map
-- The output from ./scripts/collect-system-status-info.sh
Issues with SPM
- Primary-api logs
- job-scheduler logs
Issues with Various Pages or Page Actions
- Primary-api logs
Issues with Auth/Login
- SSO pod logs
Comments
0 comments
Article is closed for comments.