Page MenuHomePhabricator

Jclark-ctr (John Clark)
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Wednesday

  • Clear sailing ahead.

User Details

User Since
Jul 24 2019, 8:11 PM (285 w, 4 d)
Availability
Available
LDAP User
Jclark-ctr
MediaWiki User
Jclark-ctr [ Global Accounts ]

Recent Activity

Thu, Jan 9

Jclark-ctr closed T383051: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1243.eqiad.wmnet, a subtask of T377876: Migrate wikikube-eqiad to containerd, as Resolved.
Thu, Jan 9, 4:33 PM · collaboration-services, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr closed T383051: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1243.eqiad.wmnet as Resolved.

Reimaged passed with no issues

Thu, Jan 9, 4:33 PM · SRE, DC-Ops, ops-eqiad, collaboration-services, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr closed T382535: PDU sensor over limit as Resolved.
Thu, Jan 9, 4:32 PM · SRE, DC-Ops, ops-eqiad
Jclark-ctr closed T381770: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1069.eqiad.wmnet as Resolved.

flea power drain and Reimaged server

Thu, Jan 9, 4:21 PM · SRE, ops-eqiad, DC-Ops, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr closed T381770: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1069.eqiad.wmnet, a subtask of T377876: Migrate wikikube-eqiad to containerd, as Resolved.
Thu, Jan 9, 4:21 PM · collaboration-services, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr added a comment to T382535: PDU sensor over limit.

Rebalanced AA breaker and BB breaker

Thu, Jan 9, 4:10 PM · SRE, DC-Ops, ops-eqiad
Jclark-ctr closed T381676: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1057.eqiad.wmnet, a subtask of T377876: Migrate wikikube-eqiad to containerd, as Resolved.
Thu, Jan 9, 3:56 PM · collaboration-services, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr closed T381676: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1057.eqiad.wmnet as Resolved.

Reimaged server without issues. it was posted onto T381789 ticket by mistake

Thu, Jan 9, 3:56 PM · serviceops, SRE, ops-eqiad, DC-Ops
Jclark-ctr closed T381789: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1073.eqiad.wmnet, a subtask of T377876: Migrate wikikube-eqiad to containerd, as Resolved.
Thu, Jan 9, 3:40 PM · collaboration-services, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr closed T381789: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1073.eqiad.wmnet as Resolved.

Reimaged passed with no issues

Thu, Jan 9, 3:40 PM · SRE, ops-eqiad, DC-Ops, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr closed T381878: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1081.eqiad.wmnet, a subtask of T377876: Migrate wikikube-eqiad to containerd, as Resolved.
Thu, Jan 9, 2:47 PM · collaboration-services, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr closed T381878: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1081.eqiad.wmnet as Resolved.
Thu, Jan 9, 2:47 PM · SRE, DC-Ops, ops-eqiad, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr added a comment to T381878: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1081.eqiad.wmnet.

@Jelto i am going to start flea power draining them and reimaging them wanted to try to resolve 1 at a time

Thu, Jan 9, 2:47 PM · SRE, DC-Ops, ops-eqiad, Prod-Kubernetes, Kubernetes, serviceops

Wed, Jan 8

Jclark-ctr closed T380499: Q2:rack/setup/install cloudcontrol1011 as Resolved.
Wed, Jan 8, 6:18 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T380499: Q2:rack/setup/install cloudcontrol1011.
Wed, Jan 8, 6:17 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Jclark-ctr added a comment to T380499: Q2:rack/setup/install cloudcontrol1011.
Failed to load ldlinux.c32
Boot failed: press a key to retry, or wait for reset...
..............
Wed, Jan 8, 5:40 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T380499: Q2:rack/setup/install cloudcontrol1011.
Wed, Jan 8, 5:39 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Jclark-ctr added a comment to T381878: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1081.eqiad.wmnet.

@Jelto i performed flea power drain and looks to image properly the critical status has cleared will update dell but looks good for now

Wed, Jan 8, 4:52 PM · SRE, DC-Ops, ops-eqiad, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr added a comment to T375842: decommission mw[1349-1413].

@akosiaris i see you checked off all the boxes for the dcops team are these ready to be removed?

Wed, Jan 8, 4:43 PM · Patch-For-Review, SRE, DC-Ops, ops-eqiad, serviceops, decommission-hardware
Jclark-ctr closed T383033: decommission dbproxy1021.eqiad.wmnet, a subtask of T368874: Productionize dbproxy102[89], as Resolved.
Wed, Jan 8, 4:40 PM · Patch-For-Review, DBA
Jclark-ctr closed T383033: decommission dbproxy1021.eqiad.wmnet as Resolved.
Wed, Jan 8, 4:40 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware
Jclark-ctr updated the task description for T383033: decommission dbproxy1021.eqiad.wmnet.
Wed, Jan 8, 4:39 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware
Jclark-ctr closed T383025: decommission dbproxy1020.eqiad.wmnet, a subtask of T368874: Productionize dbproxy102[89], as Resolved.
Wed, Jan 8, 4:37 PM · Patch-For-Review, DBA
Jclark-ctr closed T383025: decommission dbproxy1020.eqiad.wmnet as Resolved.
Wed, Jan 8, 4:37 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware
Jclark-ctr updated the task description for T383025: decommission dbproxy1020.eqiad.wmnet.
Wed, Jan 8, 4:37 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware
Jclark-ctr moved T383025: decommission dbproxy1020.eqiad.wmnet from Backlog to Decommission on the ops-eqiad board.
Wed, Jan 8, 3:57 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware
Jclark-ctr moved T375842: decommission mw[1349-1413] from Backlog to Decommission on the ops-eqiad board.
Wed, Jan 8, 3:57 PM · Patch-For-Review, SRE, DC-Ops, ops-eqiad, serviceops, decommission-hardware
Jclark-ctr moved T383033: decommission dbproxy1021.eqiad.wmnet from Backlog to Decommission on the ops-eqiad board.
Wed, Jan 8, 3:56 PM · SRE, DC-Ops, ops-eqiad, DBA, decommission-hardware

Thu, Dec 19

Jclark-ctr closed T382002: PDU sensor over limit as Resolved.

rebalanced pdu for B4. L1 A

Thu, Dec 19, 7:05 PM · SRE, DC-Ops, ops-eqiad
Jclark-ctr updated the task description for T380499: Q2:rack/setup/install cloudcontrol1011.
Thu, Dec 19, 6:52 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops

Tue, Dec 17

Jclark-ctr closed T380673: Kernel error Server cloudvirt1061 may have kernel errors as Resolved.
Tue, Dec 17, 7:21 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Jclark-ctr added a comment to T380673: Kernel error Server cloudvirt1061 may have kernel errors.

So that was my mistake i have found out from dell that it only supports 6x dimms for cpu2. 10x dimm for cpu1.

Tue, Dec 17, 7:21 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Jclark-ctr added a comment to T382002: PDU sensor over limit.

I see an-presto1005 was just changed to decom status in netbox waiting for decom ticket to remove from rack should resolve power issue

Tue, Dec 17, 7:17 PM · SRE, DC-Ops, ops-eqiad
Jclark-ctr moved T381742: Degraded RAID on aqs1014 from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Tue, Dec 17, 7:15 PM · SRE, DC-Ops, ops-eqiad
Jclark-ctr assigned T381742: Degraded RAID on aqs1014 to VRiley-WMF.
Tue, Dec 17, 7:14 PM · SRE, DC-Ops, ops-eqiad
Jclark-ctr moved T381878: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1081.eqiad.wmnet from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Tue, Dec 17, 7:14 PM · SRE, DC-Ops, ops-eqiad, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr claimed T381878: hw troubleshooting: "Comm Error: backplane 0" for wikikube-worker1081.eqiad.wmnet.

Confirmed: Service Request 202767674 was successfully submitted.

Tue, Dec 17, 7:14 PM · SRE, DC-Ops, ops-eqiad, Prod-Kubernetes, Kubernetes, serviceops

Dec 11 2024

Jclark-ctr closed T381902: hw troubleshooting: Stuck/bugged BMC on ml-lab1002.eqiad.wmnet as Resolved.
Dec 11 2024, 10:12 PM · SRE, Machine-Learning-Team, ops-eqiad, DC-Ops
Jclark-ctr assigned T382033: Degraded RAID on aqs1014 to VRiley-WMF.

@VRiley-WMF looks like it came back T362841 same drive SDG

Dec 11 2024, 10:05 PM · SRE, ops-eqiad, DC-Ops
Jclark-ctr updated subscribers of T380673: Kernel error Server cloudvirt1061 may have kernel errors.

i have updated firmwares and dell sees no issues. these where ordered with 512 memory and are listing the correct amount I thought it was odd cloudvirt1054 - cloudvirt1061 are all unbalanced memory between the CPU1 /2 @wiki_willy was there any reason these would be unbalanced?

Dec 11 2024, 9:59 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T378368: Q2:rack/setup/install cloudelastic101[12].
Dec 11 2024, 9:11 PM · Data-Platform-SRE (2025.01.11 - 2025.01.31), SRE, ops-eqiad, Discovery-Search, DC-Ops

Dec 10 2024

Jclark-ctr added a comment to T378368: Q2:rack/setup/install cloudelastic101[12].

@elukey the 10g card is copper rj45 and not in use. AOC-ATGC-i2TM. The 10g port is connected using DAC cable to AOC-A25G-b2SM.

Dec 10 2024, 12:08 AM · Data-Platform-SRE (2025.01.11 - 2025.01.31), SRE, ops-eqiad, Discovery-Search, DC-Ops

Dec 5 2024

Jclark-ctr updated subscribers of T378368: Q2:rack/setup/install cloudelastic101[12].

@elukey Hey luca these two are failing to provision these are custom configs

Dec 5 2024, 5:06 PM · Data-Platform-SRE (2025.01.11 - 2025.01.31), SRE, ops-eqiad, Discovery-Search, DC-Ops
Jclark-ctr updated the task description for T378368: Q2:rack/setup/install cloudelastic101[12].
Dec 5 2024, 5:05 PM · Data-Platform-SRE (2025.01.11 - 2025.01.31), SRE, ops-eqiad, Discovery-Search, DC-Ops
Jclark-ctr closed T379856: Degraded RAID on an-worker1169 as Resolved.

Replaced Failed Drive

Dec 5 2024, 3:16 PM · SRE, ops-eqiad, DC-Ops
Jclark-ctr closed T378030: Q2:rack/setup/install wdqs102[567] as Resolved.
Dec 5 2024, 12:43 AM · Data-Platform-SRE (2024.11.30 - 2024.12.20), wmde-wikidata-tech, Wikidata, Wikidata-Query-Service, SRE, Discovery-Search, ops-eqiad, DC-Ops
Jclark-ctr closed T371389: Q1:rack/setup/install ms-be10{83-91} as Resolved.
Dec 5 2024, 12:38 AM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T378030: Q2:rack/setup/install wdqs102[567].
Dec 5 2024, 12:37 AM · Data-Platform-SRE (2024.11.30 - 2024.12.20), wmde-wikidata-tech, Wikidata, Wikidata-Query-Service, SRE, Discovery-Search, ops-eqiad, DC-Ops
Jclark-ctr updated subscribers of T378143: Q2:rack/setup/install es104[1-6].

running into issues with the last two @ABran-WMF es1043 is imaged but will not pass certificate for puppet es1045 will not pxe @Jhancock.wm if you get a chance can you take a look at these two see if you can tell whats missing

Dec 5 2024, 12:35 AM · Data-Persistence-Automations, DBA, SRE, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr closed T371435: Q1:eqiad:frack network upgrade tracking task as Resolved.
Dec 5 2024, 12:29 AM · SRE, fundraising-tech-ops, netops, ops-eqiad, DC-Ops, Infrastructure-Foundations
Jclark-ctr closed T376547: Inbound interface errors - msw1-eqiad.mgmt.eqiad.wmnet as Resolved.

No active allerts in librenms

Dec 5 2024, 12:28 AM · SRE, DC-Ops, ops-eqiad
Jclark-ctr closed T381230: PDU sensor over limit as Resolved.

Rebalanced Pdu

Dec 5 2024, 12:28 AM · SRE, DC-Ops, ops-eqiad
Jclark-ctr claimed T380673: Kernel error Server cloudvirt1061 may have kernel errors.

I did notice it looks like memory is missing from inventory report looks like slots b7-b10 are not reporting any memory

Dec 5 2024, 12:25 AM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Jclark-ctr closed T380182: Inbound interface errors as Resolved.
Dec 5 2024, 12:20 AM · SRE, DC-Ops, ops-eqiad
Jclark-ctr added a comment to T380673: Kernel error Server cloudvirt1061 may have kernel errors.

Followed up with Dell. can you confirm that i can power down server again tomorrow to inspect memory @aborrero

Dec 5 2024, 12:14 AM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops

Dec 4 2024

Jclark-ctr updated the task description for T378143: Q2:rack/setup/install es104[1-6].
Dec 4 2024, 11:43 PM · Data-Persistence-Automations, DBA, SRE, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T371389: Q1:rack/setup/install ms-be10{83-91}.
Dec 4 2024, 10:12 PM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T378143: Q2:rack/setup/install es104[1-6].
Dec 4 2024, 1:59 PM · Data-Persistence-Automations, DBA, SRE, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T378143: Q2:rack/setup/install es104[1-6].
Dec 4 2024, 1:58 PM · Data-Persistence-Automations, DBA, SRE, Data-Persistence, ops-eqiad, DC-Ops

Dec 3 2024

Jclark-ctr closed T381283: wdqs1025 fails to PXE boot, NIC shows "no link" in DRAC web UI as Resolved.

@bking replaced cable link came up sorry for delay

Dec 3 2024, 11:42 PM · Data-Platform-SRE (2024.11.30 - 2024.12.20), wmde-wikidata-tech, Wikidata, Wikidata-Query-Service, SRE, Discovery-Search, ops-eqiad, DC-Ops
Jclark-ctr closed T381283: wdqs1025 fails to PXE boot, NIC shows "no link" in DRAC web UI, a subtask of T378030: Q2:rack/setup/install wdqs102[567], as Resolved.
Dec 3 2024, 11:36 PM · Data-Platform-SRE (2024.11.30 - 2024.12.20), wmde-wikidata-tech, Wikidata, Wikidata-Query-Service, SRE, Discovery-Search, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T371389: Q1:rack/setup/install ms-be10{83-91}.
Dec 3 2024, 11:21 PM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T371389: Q1:rack/setup/install ms-be10{83-91}.
Dec 3 2024, 11:12 PM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T371389: Q1:rack/setup/install ms-be10{83-91}.
Dec 3 2024, 10:54 PM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr moved T371389: Q1:rack/setup/install ms-be10{83-91} from Racking Tasks to Remote Work on the ops-eqiad board.
Dec 3 2024, 9:05 PM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T371389: Q1:rack/setup/install ms-be10{83-91}.
Dec 3 2024, 8:49 PM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T371389: Q1:rack/setup/install ms-be10{83-91}.
Dec 3 2024, 6:01 PM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T371389: Q1:rack/setup/install ms-be10{83-91}.
Dec 3 2024, 12:34 AM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops

Nov 27 2024

Jclark-ctr moved T380673: Kernel error Server cloudvirt1061 may have kernel errors from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Nov 27 2024, 2:25 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Jclark-ctr added a comment to T380673: Kernel error Server cloudvirt1061 may have kernel errors.

Finished with bios update waiting on dell for response for new ticket

Nov 27 2024, 2:25 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
Jclark-ctr added a comment to T380673: Kernel error Server cloudvirt1061 may have kernel errors.

Dell rejected parts request opening new ticket with them 201666996

Nov 27 2024, 2:11 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops

Nov 26 2024

Jclark-ctr claimed T379856: Degraded RAID on an-worker1169.

Confirmed: Service Request 201596930 was successfully submitted.

Nov 26 2024, 2:12 PM · SRE, ops-eqiad, DC-Ops
Jclark-ctr added a comment to T380673: Kernel error Server cloudvirt1061 may have kernel errors.

@aborrero i have updated Idrac firmware. I assume Dell will want me to update bios firmware which will require reboot I will open up ticket with Dell requesting memory replacement

Nov 26 2024, 2:02 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops

Nov 19 2024

xcollazo awarded T380278: High priority: Disk space expansion on an-launcher1002 a Pterodactyl token.
Nov 19 2024, 8:31 PM · Data-Platform-SRE (2024.11.09 - 2024.11.29), SRE, ops-eqiad, DC-Ops
Jclark-ctr assigned T377878: Q2:rack/setup/install an-worker11[78-86] to VRiley-WMF.
Nov 19 2024, 8:29 PM · SRE, Data-Platform, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T377878: Q2:rack/setup/install an-worker11[78-86].
Nov 19 2024, 8:25 PM · SRE, Data-Platform, ops-eqiad, DC-Ops
Jclark-ctr closed T380278: High priority: Disk space expansion on an-launcher1002 as Resolved.

Installed 2 x960gb ssd into slot 2/3

Nov 19 2024, 8:23 PM · Data-Platform-SRE (2024.11.09 - 2024.11.29), SRE, ops-eqiad, DC-Ops
Jclark-ctr closed T380278: High priority: Disk space expansion on an-launcher1002, a subtask of T380222: Analytics airflow instance not showing any DAGs, as Resolved.
Nov 19 2024, 8:21 PM · Data-Platform-SRE (2024.11.09 - 2024.11.29)
Jclark-ctr closed T378185: Q2:rack/setup/install wikikube-worker13[13-28] as Resolved.
Nov 19 2024, 5:57 PM · SRE, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T378185: Q2:rack/setup/install wikikube-worker13[13-28].
Nov 19 2024, 5:57 PM · SRE, ops-eqiad, DC-Ops
Jclark-ctr moved T371435: Q1:eqiad:frack network upgrade tracking task from Racking Tasks to Decommission on the ops-eqiad board.
Nov 19 2024, 5:30 PM · SRE, fundraising-tech-ops, netops, ops-eqiad, Infrastructure-Foundations, DC-Ops
Jclark-ctr moved T370453: Q1:rack/setup/install thanos-be1005 from Racking Tasks to Remote Work on the ops-eqiad board.
Nov 19 2024, 4:12 PM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T370453: Q1:rack/setup/install thanos-be1005.
Nov 19 2024, 3:04 PM · SRE, SRE-swift-storage, Data-Persistence, ops-eqiad, DC-Ops
Jclark-ctr closed T377874: Q2:rack/setup/install kafka-jumbo10[16-18] as Resolved.
Nov 19 2024, 2:33 AM · SRE, ops-eqiad, Data-Platform, DC-Ops
Jclark-ctr updated the task description for T377874: Q2:rack/setup/install kafka-jumbo10[16-18].
Nov 19 2024, 2:33 AM · SRE, ops-eqiad, Data-Platform, DC-Ops

Nov 18 2024

Jclark-ctr added a comment to T379454: Degraded RAID on wikikube-worker1256.

Confirmed: Service Request 201149035

Nov 18 2024, 7:00 PM · serviceops, DC-Ops, ops-eqiad
Jclark-ctr added a comment to T379454: Degraded RAID on wikikube-worker1256.

Opened ticket with Dell Advised of i/o errors on sda and uploaded tsr report

Nov 18 2024, 6:59 PM · serviceops, DC-Ops, ops-eqiad

Nov 16 2024

Jclark-ctr updated the task description for T377878: Q2:rack/setup/install an-worker11[78-86].
Nov 16 2024, 6:06 PM · SRE, Data-Platform, ops-eqiad, DC-Ops
Jclark-ctr moved T377874: Q2:rack/setup/install kafka-jumbo10[16-18] from Racking Tasks to Remote Work on the ops-eqiad board.
Nov 16 2024, 6:01 PM · SRE, ops-eqiad, Data-Platform, DC-Ops
Jclark-ctr updated the task description for T377874: Q2:rack/setup/install kafka-jumbo10[16-18].
Nov 16 2024, 6:00 PM · SRE, ops-eqiad, Data-Platform, DC-Ops
Jclark-ctr moved T379622: wikikube-ctrl1001.eqiad.wmnet: The CMOS battery has reached the end of its usable life or has failed. from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Nov 16 2024, 5:18 PM · SRE, DC-Ops, ops-eqiad, Prod-Kubernetes, Kubernetes, serviceops
Jclark-ctr moved T379717: wikikube-ctrl1002 and wikikube-ctrl1003: Switch network cable from port 2 to port 1 on the 10G NIC from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Nov 16 2024, 5:18 PM · SRE, DC-Ops, Prod-Kubernetes, ops-eqiad, serviceops
Jclark-ctr moved T379668: PDU sensor over limit from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Nov 16 2024, 5:18 PM · SRE, DC-Ops, ops-eqiad
Jclark-ctr moved T379454: Degraded RAID on wikikube-worker1256 from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Nov 16 2024, 5:17 PM · serviceops, DC-Ops, ops-eqiad
Jclark-ctr moved T379612: decommission ganeti1010 / ganeti1013 from Backlog to Decommission on the ops-eqiad board.
Nov 16 2024, 5:17 PM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Jclark-ctr moved T380050: Decommission E/F 8 Dell switches from Backlog to Decommission on the ops-eqiad board.
Nov 16 2024, 5:17 PM · Patch-For-Review, SRE, DC-Ops, ops-eqiad
Jclark-ctr moved T379856: Degraded RAID on an-worker1169 from Backlog to Hardware Failure / Troubleshoot on the ops-eqiad board.
Nov 16 2024, 5:17 PM · SRE, ops-eqiad, DC-Ops
Jclark-ctr moved T378185: Q2:rack/setup/install wikikube-worker13[13-28] from Racking Tasks to Remote Work on the ops-eqiad board.
Nov 16 2024, 5:16 PM · SRE, ops-eqiad, DC-Ops
Jclark-ctr updated the task description for T378185: Q2:rack/setup/install wikikube-worker13[13-28].
Nov 16 2024, 5:16 PM · SRE, ops-eqiad, DC-Ops

Nov 15 2024

Jclark-ctr updated the task description for T377878: Q2:rack/setup/install an-worker11[78-86].
Nov 15 2024, 10:44 PM · SRE, Data-Platform, ops-eqiad, DC-Ops
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy