/srv/irclogs.ubuntu.com/2021/10/04/#ubuntu-security.txt

=== grimmetywares is now known as grimmware
=== ChanServ changed the topic of #ubuntu-security to: Twitter: @ubuntu_sec || https://usn.ubuntu.com || https://wiki.ubuntu.com/SecurityTeam || https://wiki.ubuntu.com/Security/Features || Community: amurray
sarnold_teward: jeeeeze, that's an unhappy system and unhappy sysadmin..18:07
=== sarnold_ is now known as sarnold
teward'grumpy' sysadmin is the proper wording18:08
tewardi also didn't sleep well so i'm extra grumpy today18:08
teward:P18:08
JanCtalking about bad updates: the person who pushed that BGP update to the routers in Facebook's DCs is probably looking for flights out of the country by now  :P18:19
tewardhah18:36
tewardwe sure it's BGP updates that took it out?  I mean, probably was, but still xD18:37
teward(there's nowhere on the planet they can hide, their flight better be a SpaceX)18:37
JanCfrom a FB tech on reddit (now deleted again):18:37
JanC"""As many of you know, DNS for FB services has been affected and this is likely a symptom of the actual issue, and that's that BGP peering with Facebook peering routers has gone down, very likely due to a configuration change that went into effect shortly before the outages happened (started roughly 1540 UTC).18:38
JanCThere are people now trying to gain access to the peering routers to implement fixes, but the people with physical access is separate from the people with knowledge of how to actually authenticate to the systems and people who know what to actually do, so there is now a logistical challenge with getting all that knowledge unified.18:38
JanCPart of this is also due to lower staffing in data centers due to pandemic measures."""18:38
JanCalso: allegedly they normally use FB Messenger for internal communications18:39
tewardso basically, FB screwed themselves xD18:40
sarnoldit's super-helpful to have a channel on oftc or libera or maybe even both :)18:42
tewardor a Slack for them xD18:42
tewardbut you're not wrong18:42
JanCthey already apologised on Twitter18:42
JanCthat must have hurt  :P18:43
JanCteward/sarnold: now you assume their internal network routing to Slack servers is still up...18:43
tewardaccurate statement18:44
JanCor to IRC18:44
tewardwell if they fubar'd their network THAT badly with bad BGP routes they failed hard xD18:44
JanCmobile phones probably still work, but maybe not inside the DC18:44
sarnoldJanC: easy peasy, cell phone tether18:45
JanCI guess you could set up some route over a mobile phone to the internal network inside each DC, once you can log into the router, but you teh person with the password/key for that is maybe 500km away   :)18:47
JanCI'm sort of surprised they don't have some sort of "hardcoded" route into their DCs...18:48
sarnolda modem with serial port ..18:55
JanC"""Was just on phone with someone who works for FB who described employees unable to enter buildings this morning to begin to evaluate extent of outage because their badges weren’t working to access doors."""19:00
JanCimagine not getting into the DC either  ;)19:01
tewardheheh whoops xD19:01
JanChello, can we rent a bulldozer from you?19:01
tewardaccurate19:08
JanC<allie> process control failure lead to applying a too aggressive export filter. the routers complied, stopped announcing routes to the internet, and FB's OOB network management fell over because it had a sneaky dependency on the rest of FB's network20:28
JanCso they *did* have a "hardcoded" independent control route... except it wasn't as independent as they thought!  :P20:29

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!