/srv/irclogs.ubuntu.com/2022/11/10/#launchpad.txt

guruprasadRikMills, ahasenack, the PPA publisher appears to be fixed now and I can see it publishing builds. Your builds could be stuck in the backlog and may need some time to be processed.07:53
RikMillsguruprasad: Thanks. I think it already caught up with the ones that mattered, but thanks for the info07:54
toidihi, I'm trying to pull a good bit of data out of launchpad and am wondering what the etiquette is. Is it possible to run a local mirror? Should I be rate limiting? What can I do to be well behaved?18:35
sarnoldhey toidi, I'm not on the launchpad team, but a few thoughts .. running a local mirror feels pretty implausible, it manages so much data that getting a mirror in the first place feels impossible. I know some of the security team's scripts have some local caching of information that's been retrieved to avoid round-trips through the APIs where we can18:37
sarnoldrate limiting seems like a very good idea indeed; once in a while I see mentions in internal channels of a client somewhere hammering a service and earning a blackhole route as a prize :)18:38
toidiOk, I understand. I'm currently using the launchpadlib package, do you know if there's a way to set that to ratelimit politely?18:38
sarnoldif you 'just' want an ubuntu archive mirror, that's an approachable problem18:39
sarnoldah sorry, no idea there :(18:39
toidiWell, what I actually want is *all* the debs18:39
toidiand ddebs... and dsc... etc18:39
toidiThe mirrors AFAICT only have recent tips, eg libssl3_3.0.2-0ubuntu1.7_amd64.deb18:40
toidithey do not have libssl3_3.0.2-0ubuntu1.*6*_amd64.deb18:40
toidiand so on18:40
toidiIdeally with build info as well18:41
toidiI understand thats a large volume of data and would be perfectly willing to just mail over a hard drive or whatever since 99% of it will never change again18:41
toidibut I imagine that's impossible18:42
sarnoldahhh that's a good challenge :) the mirrors do remove packages that aren't referenced in any of the lists.. you could do the usual rsync mirroring but skip the --delete --delete-after parts ..18:42
sarnoldit'd help collecting new stuff but couldn't help much for old stuff, and doesn't address the ddebs at all :/18:42
toidiyeah, on a forward moving basis I could pull the archives and the ddeb archives18:43
toidibut this is mostly looking backwards18:43
sarnoldand with ddebs, that service feels unreliable enough that pulling from launchpad is probably more reliable inthe long run18:44
toidiit certainly is nice to have them all bundled together between souce, binary, and debuginfo18:44
toidiI can recreate those links with buildids and it works, but it's painful18:44
toididpkg -x grep grep grep repeat18:45
toidiso in the meantime, it sounds like just putting some small pause between requests is the way to go?18:46
toidiany sense of what's an acceptable rate?18:46
toidiand does it matter if I'm logged in or not? Not touching any restricted data or trying to write anything, but I feel like it's impolite if the entire pool of anonymous users gets some usage quota18:47
toidi(thanks for your help, btw. Sorry to pelt you with questions)18:49
sarnoldtoidi: yeah, I like the 'delays' options; I don't know quite what ot suggest, but my first thought is to measure the time they take to execute and sleep twice that? that would scale up and down a bit with the load on the system..18:55
sarnoldtoidi: no idea on logged in vs anon18:55
toidiok, sounds like an interesting approach. I'll see how it works. Thanks very much for your help18:57
toidiI'll reiterate that if anyone knows a way to get this data without hammering launchpad I am totally happy to use it.18:57

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!