/srv/irclogs.ubuntu.com/2009/08/06/#launchpad-meeting.txt

=== danilo-afk is now known as danilo
=== mrevell is now known as mrevell-lunch
=== mrevell-lunch is now known as mrevell
=== Edwin is now known as Guest73773
=== Guest73773 is now known as EdwinGrubbs
Ursinhame?16:00
sinzuiyou16:00
sinzuius16:00
intellectronicaUrsinha: no, not you16:00
sinzuithem16:00
Ursinhaah16:00
Ursinha:(16:00
matsubarasorry16:00
matsubara#startmeeting16:00
MootBotMeeting started at 10:00. The chair is matsubara.16:00
MootBotCommands Available: [TOPIC], [IDEA], [ACTION], [AGREED], [LINK], [VOTE]16:00
Ursinharoll call,roll call16:00
matsubaramy firefox died16:00
sinzuime16:00
* stub belches16:00
matsubarahang on a second please16:00
henningeme16:00
Ursinhapoor matsubara16:00
* jml eavesdrops16:01
* bigjools wafts stub's belch away16:01
rockstarni16:01
matsubaraWelcome to this week's Launchpad Production Meeting. For the next 45 minutes or so, we'll be coordinating the resolution of specific Launchpad bugs and issues.16:01
matsubara[TOPIC] Roll Call16:01
MootBotNew Topic:  Roll Call16:01
Ursinhameeee16:01
bigjoolsme16:01
matsubaraNot on the Launchpad Dev team? Welcome! Come "me" with the rest of us!16:01
stubme16:01
henningeme again16:01
intellectronicai16:01
matsubaraflacoste, hi16:01
flacosteme16:02
matsubaraherb, hi16:02
herbme16:02
matsubaraok, everyone here.16:02
matsubara[TOPIC] Agenda16:02
MootBotNew Topic:  Agenda16:02
matsubara * Actions from last meeting16:02
matsubara * Oops report & Critical Bugs & Broken scripts16:02
matsubara * Operations report (mthaddon/herb/spm)16:02
matsubara * DBA report (stub)16:02
matsubara[TOPIC] * Actions from last meeting16:02
MootBotNew Topic:  * Actions from last meeting16:02
matsubara * matsubara to chase rockstar about failure on updatebranches script16:02
matsubara * stub to give a try on bug 354593 with mars help if needed16:02
matsubara * stub to fix bug 31081816:02
matsubara * mars to take a look at OOPS-1307J1616:02
matsubara * Discuss the solution proposed by gary_poster after the meeting, about ExpatErrors and bug 40360616:02
matsubara * mars and stub to discuss the Disconnection and OperationalErrors after the meeting16:02
jmlme16:02
ubottuLaunchpad bug 354593 in launchpad-foundations "SSO exceptions views need proper branding" [High,Triaged] https://launchpad.net/bugs/35459316:02
ubottuLaunchpad bug 310818 in launchpad-foundations "Oops report does not always log timed-out query" [High,In progress] https://launchpad.net/bugs/31081816:02
ubottuhttps://lp-oops.canonical.com/oops.py/?oopsid=1307J1616:03
Ursinhayay, jml16:03
ubottuLaunchpad bug 403606 in launchpad-registry "ExpatError errors should be handled to not generate the OOPSes" [Undecided,New] https://launchpad.net/bugs/40360616:03
matsubaraI suck, I didn't chase rockstar about the updatebranches script failures16:03
rockstarmatsubara, I thought we agreed that mwhudson would be better to chase on it.16:03
jmlmatsubara, got a URL for the failure?16:03
matsubaraotoh, the script is not failing anymore...16:03
rockstarmatsubara, I know mwhudson was looking at it on his Tuesday.16:03
rockstarjml would be good to ask as well.16:03
=== cprov is now known as cprov-lunch
matsubararockstar, all right. I'll talk to jml and mwhudson later on today16:04
matsubara[action] * matsubara to chase mwhudson/jml about failure on updatebranches script16:04
MootBotACTION received:  * matsubara to chase mwhudson/jml about failure on updatebranches script16:04
rockstarmatsubara, jml is here right now. :)16:04
matsubarajml, I'll get you an url for the scripts after the meeting. I need to trawl my emails to find it16:04
jmlmatsubara, ok. thanks.16:04
matsubarastub, how's 354593 fix coming along?16:05
flacostewhy is this High again?16:05
matsubaraI wonder if mars had time to look over OOPS-1307J1616:06
ubottuhttps://lp-oops.canonical.com/oops.py/?oopsid=1307J1616:06
matsubaraflacoste, do you know ^?16:06
flacostehmm, i put it as such16:06
flacosteany reason it should be?16:06
matsubaraflacoste, according to the bug history you made it high :-)16:06
flacostedebranding of the SSO is a U1/ISD affair anyway16:06
stubmatsubara: Slow. I need to discuss with people how to actually do it - maybe next week on the sprint if I get time.16:06
* sinzui agrees with flacoste16:07
flacostestub: i think we should try to get stu and James to do it :-)16:07
flacosteespecially, stu, it would be a test good case for transfer knowledge16:07
stubAnything that means I don't have to work out how ZPT macros works is fine by me.16:07
flacoste+116:07
matsubaraUrsinha, what's up with  "Discuss the solution proposed by gary_poster after the meeting, about ExpatErrors and bug 403606"?16:08
ubottuLaunchpad bug 403606 in launchpad-registry "ExpatError errors should be handled to not generate the OOPSes" [Undecided,New] https://launchpad.net/bugs/40360616:08
Ursinhamatsubara, the ExpatErrors were being discussed by mars and gary16:08
gary_postermatsubara: that;s now registry.  it actualy is a legitimate oops16:08
matsubara[action] stub to delegate bug 354593 to ISD16:08
ubottuLaunchpad bug 354593 in launchpad-foundations "SSO exceptions views need proper branding" [High,Triaged] https://launchpad.net/bugs/35459316:08
MootBotACTION received:  stub to delegate bug 354593 to ISD16:08
gary_posterit indicates a problem with mailman integration16:08
sinzuiI will ask barry to look into bug 40360616:09
ubottuLaunchpad bug 403606 in launchpad-registry "ExpatError errors should be handled to not generate the OOPSes" [Undecided,New] https://launchpad.net/bugs/40360616:09
matsubarastub, You recently fixed a DisconnectionError bug. was it related to the errors you discussed with mars? that action item is now done?16:09
matsubarathanks sinzui and gary_poster16:09
gary_poster:-)16:10
stubmatsubara: I landed code to log OOPS reports on DisconnectionError before retrying the request. Is that what you mean?16:10
matsubarastub, I mean: "* mars and stub to discuss the Disconnection and OperationalErrors after the meeting"16:10
Ursinhastub, is that what caused the TransactionRollbackError oopses?16:11
stubWe discussed. I don't recall much about the conversation though :)16:11
matsubara:-)16:11
stubUrsinha: That fix was, yes. I've got another branch that turns the volume down so we don't log the TransactionCommitError's16:11
matsubara[action] sinzui to ask barry to fix bug 40360616:12
ubottuLaunchpad bug 403606 in launchpad-registry "ExpatError errors should be handled to not generate the OOPSes" [Undecided,New] https://launchpad.net/bugs/40360616:12
MootBotACTION received:  sinzui to ask barry to fix bug 40360616:12
Ursinhastub, good, I filed bug 409907 for that16:12
ubottuLaunchpad bug 409907 in launchpad-foundations "TransactionRollbackErrors may prevent us to detect real issues" [Undecided,New] https://launchpad.net/bugs/40990716:12
matsubaraUrsinha, is there a bug for OOPS-1307J16?16:13
ubottuhttps://lp-oops.canonical.com/oops.py/?oopsid=1307J1616:13
Ursinhamatsubara, not that I opened one, because we needed to know what was going on over there16:13
Ursinhato open the bug16:13
Ursinhaso mars was going to investigate that16:13
UrsinhaI don't recall having those anymore16:15
matsubara[action] ursinha to chase mars about OOPS-1307J16 and file a bug about it16:15
ubottuhttps://lp-oops.canonical.com/oops.py/?oopsid=1307J1616:15
MootBotACTION received:  ursinha to chase mars about OOPS-1307J16 and file a bug about it16:15
ubottuhttps://lp-oops.canonical.com/oops.py/?oopsid=1307J1616:15
matsubaraI think that's all for last meeting's action items16:15
matsubarathanks everyone16:15
matsubara[TOPIC] * Oops report & Critical Bugs & Broken scripts16:15
MootBotNew Topic:  * Oops report & Critical Bugs & Broken scripts16:15
Ursinhathere are two issues to discuss16:16
Ursinhaone was about bug 409907, that I already mentioned to stub and it's being handled16:16
ubottuLaunchpad bug 409907 in launchpad-foundations "TransactionRollbackErrors may prevent us to detect real issues" [Undecided,New] https://launchpad.net/bugs/40990716:16
Ursinhathe other is about the select replication_lag() timeouts we're having16:16
Ursinhamthaddon also reported problems that we don't know if are related to that16:17
UrsinhaI don't know if there's much to be discussed at this point, because it seems we need to fix oops reports first to be able to see the real problem here16:18
Ursinhais that correct stub:16:18
Ursinha?16:18
matsubarashould we request a CP for the branch that fixes the oops log?16:18
intellectronicagiven that we're skipping a release, that's probably a good idea16:19
Ursinhaflacoste, I've spoken with jtv yesterday about those,and he also said that was unlikely to be his changes fault (possible but unlikely)16:19
stubI landed code today that should tell us more about if the timeout is actually occuring due to blocking on the database, or elsewhere.16:20
Ursinhas/his/translations/16:20
Ursinhastub, should we request a CP?16:20
flacosteyeah, i really think a CP is a good idea16:20
Ursinha(please please)16:20
matsubara[action] stub to request CP for his branch that fixes oops logging16:20
MootBotACTION received:  stub to request CP for his branch that fixes oops logging16:20
Ursinhacool16:20
Ursinhawe have two critical bugs, already fix committed16:21
Ursinhaso, good16:21
matsubaracool16:21
Ursinhaabout the failing scripts16:21
matsubarawe had some scripts failing this week16:21
matsubaranightly, productreleasefinder and garbo-hourly16:21
matsubaraand rosetta-poimport too16:22
matsubaranightly was already addressed by jtv16:22
Ursinhamatsubara, productreleasefinder isn't expected to fail anymore? sinzui?16:22
matsubaraas a rosetta script was taking too much time and jtv will remove it from nightly and add a cronjob for it16:22
sinzuiUrsinha: no, but the errors is see are not failures...the script was not run16:22
matsubarastub, do you know why garbo-hourly is failing?16:23
stubIts failing?16:23
sinzuimatsubara: many scripts are not running because of one log process16:23
matsubarahenninge, rosetta-poimport failed on the 5th. can you investigate and reply to the list?16:23
sinzuis/log/long/16:24
Ursinhamatsubara, it's not being run, it seems16:24
henningematsubara: sure, I will.16:24
matsubarastub, I got a few emails: "Scripts failed to run: loganberry:garbo-hourly"16:24
sinzuiUrsinha: matsubara there is some traffic about this. spm reported the long running prcess a weeks ago. I has asked why the prf had not run16:24
matsubaraand no replies to the list, so I'm asking here16:24
matsubarathanks henninge16:25
Ursinhamatsubara, actually stub repklied16:25
Ursinha*replied16:25
stubOh - there were some blocked runs because the rosetta export-to-branch script was running in a 5 hour long transaction16:25
stubSo the script blocks because it doesn't want to make anything worse.16:25
matsubara[action] henninge to investigate rosetta-poimport script failure on the Aug 5th and report back to the list16:25
MootBotACTION received:  henninge to investigate rosetta-poimport script failure on the Aug 5th and report back to the list16:25
=== salgado is now known as salgado-lunch
Ursinhaso I guess it's ok16:27
Ursinhathat's all for this section16:28
Ursinhafrom me16:28
Ursinhathanks everyone16:28
Ursinha!16:28
Ursinhayou can move on matsubara16:28
matsubaraall right. thanks everyone16:28
matsubara[TOPIC] * Operations report (mthaddon/herb/spm)16:28
MootBotNew Topic:  * Operations report (mthaddon/herb/spm)16:28
herb2009-07-31 - Rolled out r8323 to bzrsyncd16:28
herb2009-08-05 - Cherry picks for code imports, lpnet* and the script server.16:28
herbOur monitoring system has been timing out in connecting to the app servers more often this week. Admittedly its timeout is set lower than the OOPS timeout. But we've also been noticing higher load on the app servers as well. This was discussed by Ursinha during the oops/critical bugs/broken scripts section.16:29
herbThere's currently 1 cherry pick and 1 database query awaiting (dis)approval.16:29
herbThe LOSAs currently have 14 bugs marked high and triaged. Only 1 of which is assigned to someone and targeted for a release. We would be grateful if we saw some movement on these.16:29
herbWe're currently running with a single slave in preparation for the sprint next week.16:29
mthaddonalso wanted to check that there should be a cherry pick request for the cowboyed storm change to lpnet9 and lpnet10 (per the production status wiki page)16:30
flacostecowboyed storm change?16:30
mthaddonflacoste: https://pastebin.canonical.com/20503/ under eggs/storm-0.14salgado_storm_launchpad_288_308-py2.4-linux-i686.egg16:30
flacostemthaddon, herb: i'll look at the LPS to approve/decline16:30
flacosteright16:31
flacostemthaddon: the cherry pick would simply be to update that dependency16:31
matsubaraherb, do you keep that list of 14 bugs somewhere? in a wiki page or have a tag to group them?16:31
herbmatsubara: bugs.launchpad.net/~canonical-losas16:31
mthaddonflacoste: well in any case, the CP that was requested (and performed) yesterday overwrote it, so it needs to be formalised so other CPs don't overwrite it again16:32
flacostesinzui: can salgado makes an appropriate CP request?16:32
sinzuiYes16:32
flacosteit's simply a new upload to download-cache with a versions.cfg change16:33
matsubarasinzui, flacoste, intellectronica, rockstar: Could you take a look at herb's bug list (bugs.launchpad.net/~canonical-losas) and see what your teams can do about the high ones in the short term?16:34
flacosteok16:35
herbclearly we're not looking for all of them to be fixed by the next meeting (though that would be great ;)16:35
herbjust mostly would like to know they're staying on the right radars and are being worked on as appropriate.16:35
matsubaracool16:36
matsubaraanything else for herb?16:36
intellectronicaherb: so, basically, these are mostly bugs which will make life easier for you when fixed?16:36
sinzuibug 348722 should become invalid when we update all pmt teams to become true private teams16:36
ubottuLaunchpad bug 348722 in launchpad-code "Set default branch visibility to "forbidden" if any team set to 'Private'" [High,Triaged] https://launchpad.net/bugs/34872216:36
herbintellectronica: some of them are geniune operational issues, some of them are quality of life issues for the LOSAs16:37
sinzuiThere should be no private-membership teams at the start of week 116:37
intellectronicacool, sure, we'll take a look and see if there's any low hanging fruit16:37
sinzuibarry will be working with the losas on August 11 to fix bug 32596216:38
ubottuLaunchpad bug 325962 in launchpad-registry "lp-mailman startup is blocking on a pid file in the wrong directory" [High,Triaged] https://launchpad.net/bugs/32596216:38
herbsinzui: that was the one that was assgned and targetted at a release.16:38
sinzuiherb, many times16:39
herbassigned even16:39
herbheh16:39
matsubaraall right. I think that's it16:39
sinzuiherb it failed my rules that bug is not high if it is not worked on by all parties in 3 months16:39
herbthanks16:39
matsubarathanks herb and everyone16:39
matsubara[TOPIC] * DBA report (stub)16:39
MootBotNew Topic:  * DBA report (stub)16:39
stubWe set off some alerts when the poimport script and PostgreSQL decided that lots of disk space should be used. We see some smaller spikes, which is just PG using disk to store intermediary results, but this time it was large enough to set of the alarms.16:40
stubWe have seen this once before, and in neither case have we been able to repeat it. My best hypothesis is the planner statistics triggering a really bad query plan, so I'll bump the planner statistic sample size on the production dbs in case this stops future occurances.16:40
matsubarahenninge, maybe the last rosetta-poimport failure was related to that ^16:41
henningematsubara: I believe we already know what it was about and it may be related to that.16:43
henningematsubara: I'll talk to the guys.16:43
matsubarahenninge, cool. thanks16:43
matsubarastub, anything else?16:43
stubNot that I can think of16:43
matsubaraall right. thank you stub16:43
matsubaraI guess that's all for today16:43
matsubaraThank you all for attending this week's Launchpad Production Meeting. See https://dev.launchpad.net/MeetingAgenda for the logs.16:44
matsubara#endmeeting16:44
MootBotMeeting finished at 10:44.16:44
herbthanks everyone16:44
Ursinharight on time16:44
matsubara:-)16:44
Ursinhathanks guys16:44
=== matsubara is now known as matsubara-lunch
=== salgado-lunch is now known as salgado
=== cprov-lunch is now known as cprov
=== matsubara-lunch is now known as matsubara
=== maxb_ is now known as maxb
=== salgado is now known as salgado-afk

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!