/srv/irclogs.ubuntu.com/2009/03/16/#ubuntu-kernel.txt

rtgall ext4 crack-heads follow Ted ==> https://bugs.edge.launchpad.net/ubuntu/+source/linux/+bug/317781/comments/17512:49
ubot3Malone bug 317781 in linux "Ext4 data loss" [High,Fix released] 12:49
dandelrtg, you there?14:41
dandeli need to find out what i am supposed to do to set... CONFIG_ACPI_DEBUG and does this require a kernel compile?14:42
rtgdandel: I expect it will14:43
dandelack.14:43
dandelnow what do i ened to install to compile 2.6.29rc8 or 2.6.29rc7?14:44
* dandel is following the maintainers instructions but doesn't like kernel compiles.14:44
rtgdandel: build-essential fakeroot, etc. See https://wiki.ubuntu.com/KernelMaintenance14:45
dandelhmm14:48
dandelmakedumpfile is missing14:48
dandeloh... hardy is incomplete lol14:49
dandelok... got kernel source and whatnot installed, now what?14:52
rtgdandel: if you use 'debuild -b' then all build assumptions are checked. It'll tell you whats missing14:52
rtgof course, you must first install devscripts14:52
dandelhow do i get that?14:52
rtgthe usual way, 'apt-get install devscripts'14:53
dandelalready got it14:53
dandelall i want to do is rebuild the acpi module with debug options.14:53
rtgdandel: you can build a binary deb for just the flavour you want, 'fakeroot debian/rules build-debs flavours=generic'. Other then that, there aren't many short cuts14:55
rtgbuild-debs ==> binary-debs14:55
dandelgoing to take 5 or 6 min to get the extracted src14:56
dandeloh wait, it finished already lol14:57
dandeldebian/rules not found.14:58
loolamitk: w0àt15:01
loolthat was a woot, but mistype, anyway15:01
loolamitk: NEON can cause babbage to deadlock due to hw bugs15:01
loolamitk: Could you please turn CONFIG_NEON off in imx51?15:01
loolamitk: This will be fixed in hardware updates; we don't know when we'll get them, but since we target the babbage development for now and the release anyway, we shouldn't take the risk; I don't think we'll get the new hardware soonish and others might run into it15:02
loolamitk: This is recommended by Dave Martin from ARM15:02
loolrtg: ^ You might be interested as well15:02
rtglool: your 'Add HWCAP_NEON to the ARM hwcap.h file' won't work worth a hoot if you disable CONFIG_NEON15:04
loolrtg: Crap15:04
loollskdgkjlqzhkgl hlq15:04
loolrtg: That makes sense; but it's kind of a stupid situation15:05
rtglool: its all protected with '#ifdef CONFIG_NEON'15:05
loolrtg: Yes, I didn't think of that15:06
loolrtg: I've asked dave_m to join here; he requested both changes which are contradictory15:06
rtglool: I'll let you and amitk duke it out.15:06
loolrtg: Thanks15:07
* amitk would like to take this opportunity to reiterate - We shouldn't apply crack in such a hurry w/o testing15:07
* rtg feels like he's been bitch slapped :)15:08
dandelhmm... where do i find the ubuntu upstream git repo for 2.6.29rc815:09
rtgdandel: there isn't one. apw has a mainline build script that synthesizes a debian build from the upstream kernel git repo15:10
apwdandel, yep, in essence we rip the source tree out of our reposity and graft in the offical tree from linus' repo15:11
apwso the official source for the kernel is his code at that level15:11
apwwe then push our kernel configuration in and build packages from it15:12
apwthere are source packages for the trees in there15:12
dandelblah... i like binary build... a maintainer asked me to set a param and i don't want to build the whole kernel to do it.15:12
dandelapw, look at this... http://bugzilla.kernel.org/show_bug.cgi?id=12873 15:13
ubot3bugzilla.kernel.org bug 12873 in Config-Interrupts "ACPI_IRQ not set" [Normal,Needinfo] 15:13
dandelif i could figure how to set config_acpi_debug without having to build the whole kernel it'd be great.15:13
apwdandel, thats a pretty core so i think you'd have to rebuild most things to set that anyhow15:14
apwhave you checked if it is set already?15:14
dandelit's not15:14
dandelnot in the one that's in mainline tree15:14
apwhrm, no its not in  our standard configs, so it'd not be15:15
dandel2.6.29-020629rc8-generic does not have that option set.15:15
mjg59dandel: You can't15:15
mjg59dandel: acpi is an integral part of the kernel. Enabling debug will force most of the kernel to be rebuilt.15:15
apwyou are pretty much forced to rebuild15:15
dandel><;15:15
dandelcan i get a acpi debug enabled build then?15:16
apwi take it it has to be 2.6.29-rc715:16
dandel2.6.29rc8 has same dmesg output15:16
dandeli don't think it'll matter as long as i get the log up and point to which version of kernel it is.15:17
apwdandel, i take it you are not keen to build one15:17
dandeli haven't done it before, and last few times i royally screwed up the machine doing it.15:18
apwhrm15:18
dandelkernel builds take up a lot of space, and i don't have that to spare.15:18
* apw has a look15:18
dandellast i checked, it takes about 5 to 6gb to build the kernel.15:19
apwi'd say about 1gb, but not tiny15:19
apwwhats your platform?  32bit 64 bit?15:19
dandel3215:19
* rtg gloats about his 4 spindle 2TB build server15:20
dandelwith the dmesg of... 2.6.29-020629rc8-generic ... should of been obvious.15:20
dandeli believe the 64 bit version has a version of somethin like... 2.6.29-020629rc8-amd6415:20
apwnope: 2.6.28-8-generic15:20
apwand i can shave 50% off by asking15:21
looldave_m_: Hey15:21
loolamitk: Dave Martin == dave_m_ above15:21
loolamitk: it's not crap and it's useful15:21
loolrtg: ^15:22
dandelapw, at least i knew enough to check where the bugs where, which is better than 90% of people who just install to run programs.15:22
apwindeed15:22
loolamitk, rtg: At least at the source level, we can rebuild the pristine jaunty kernel when new hardware comes out15:22
apwdandel, is there an ubuntu bug for this?15:22
loolamitk, rtg: Now, according to dave_m_, the NEON code used by the kernel itself wont trigger the hang15:22
dandelyes15:22
apwwahts the #, can tie the kerenls to that15:23
loolamitk, rtg: So it should be safe to keep it enabled IIUC; problematic NEON code triggering the hang is in ffmpeg and in pixman, so in userpsace15:23
dandelbug # 33870115:23
dave_m_lool: I would need to double-check that.15:23
loolamitk, rtg: CONFIG_NEON or not doesn't change the fact that these could deadlock the platform15:23
rtglool: works for me15:23
looldave_m_: please do15:23
apwbug #33870115:23
ubot3Malone bug 338701 in linux "acpi_irq is not set properly." [Undecided,Incomplete] https://launchpad.net/bugs/33870115:23
dandelit has a sister bug namely bug #29432315:24
ubot3Malone bug 294323 in linux "Special Function keys broken after upgrade ( Toshiba Satilite P305D, 2.6.27 kernel) (dup-of: 261318)" [Undecided,New] https://launchpad.net/bugs/29432315:24
ubot3Malone bug 261318 in linux "Regression: new Toshiba Laptop Support (tlsup) driver breaks Toshiba hotkeys; input device does not support 'kbd' input handler" [High,Fix released] https://launchpad.net/bugs/26131815:24
loolrtg, amitk: So what dave_m_ was also proposing is to check with FSL how we can identify the problematic boards; would you be ok to merge a patch disabling/enabling NEON based on the baord?15:24
loolrtg, amitk: The hwcaps changes and the NEON disablement are really orthogonal; it's just unfortunate that the only *hardware* that we have is broken; would we have working hardware (which we'll likely have later), or another supported NEON platform (e.g. beagleboard), we wouldn't face this contradictory situation15:25
dandelalthough, that bug got semi-fixed, but the suspend/shutdown, power plug status and battery updates where all messed up15:26
loolrtg, amitk: I think if we get confirmation that NEON in the kernel works, we don't need to do anything to the config and you can just ignore my request to disable it; I'm sorry I only learnt later today that NEON was causing deadlocks on iMX5115:26
rtglool: you and amitk work it out. I'll already been smacked once today.15:27
looldave_m_: In all cases, kernel patch or not, I think it's useful to recognize problematic hardware so that we can at least assist people coming with random hangs15:27
dave_m_Apologies, it was me not realising that iMX51 is the only official target with NEON support at present.15:27
loolrtg: I'm sorry about that :-(15:27
loolrtg: Didn't know about the hardware issues earlier today, this is news to me15:27
dave_m_Just running a kernel with NEON support built in doesn't cause problems. So maybe we can split the discussion into kernel and userspace.15:29
dandelapw, mind walking me through building just enough of the kernel to boot up to the console for a dmesg report?15:30
looldave_m_: So we don't need to disable CONFIG_NEON?15:30
apwdandel, i was just working on a build for you now15:30
dandeloh, thanks :D15:31
dave_m_CONFIG_NEON by itself is not a problem.  It's only if some app tries to use NEON that the problems can occur.15:31
JayFopgraner, finally here now. stupid restricted network15:32
JayFoI'll tell you later15:32
pgranerJayFo: lol, glad to have ya15:32
JayFo:)15:32
JayFothanks15:32
dandelhmm... actually, i wonder how severe my bug really is... lol. (besides knocking out the power management on my laptop)15:33
dandeli had to debate between 2.6.24 and 2.6.27+ because 2.6.27 had proper cpu throttling on my laptop.15:33
dandeloh, how should i report bugs with the ubuntu hybernate, because the windows based installer and such for ubuntu leads to unusable hybernate.15:38
apwall bugs should go into launchpad15:39
dandelif i hybernate i get a long long dmesg log due to the fact it's trying to fit 3gb of ram in to a 1gb swap.15:39
dandelit's an installer bug from what i can tell.15:40
apwin what sense?  that it should have made a bigger swap?15:40
dandelit should let me do a custom partition table for starters15:41
apwsounds like an installer bug yes15:41
dandel8.04.2 autoconfigured everything.15:41
dandeland thus i couldn't even tell it that the swap needed to be at least 3gb15:41
dandel8.10 is the same, as for 9.0x i couldn't even get the windows based installer to even run.15:42
apwboth of those sound like they need reporting15:42
dandelif i knew which part of the launchpad to put i would of15:43
dandelanyways, i had to blacklist the ath5k driver due to the fact it can't even work right ><; doesn't detect my wap which is less than 3 feet away.15:44
amitklool: rtg: I still stick by my initial statement. It shouldn't have gone in, in the first place. There are several people who have the board that could test it (even in my absence). cooloney is in the China TZ and could've helped. The reason I am bitching is that it causes extra work. Apply-upload-test-explode-revert-upload isn't my ideal workflow.15:47
loolamitk: What does?  The NEON hwcap?15:47
amitklool: yeah15:48
loolamitk: Why so?15:48
cooloneyamitk, lool sorry i'm not get the whole story15:48
dandelhmm... i assume kernel panics are critical bugs right? ( i have a friend which on one of the newer kernels gets panics every 5 min on his laptop.15:49
amitklool: because of all the discussion that has been going on. Instead, the patch could've been emailed, tested and only then applied.15:49
loolamitk: I currently need it to demonstrate NEON support in ffmpeg in jaunty; yes, it's a new feature15:49
loolamitk: The patch is ok15:49
loolamitk: It's not wrong in any case15:49
amitklool: you claimed it was15:49
loolNo15:49
amitk17:01 < lool> amitk: NEON can cause babbage to deadlock due to hw bugs                                                                                                         gnomefreak    15:50
amitk17:01 < lool> amitk: Could you please turn CONFIG_NEON off in imx51?     15:50
loolamitk: That's unrelated to the patch I sent15:50
loolamitk: The patch I sent is to enable NEON 15:50
lool*hwcaps*15:50
loolNot CONFIG_NEON15:50
loolamitk: CONFIG_NEON is currently turned on already15:50
loolamitk: I think you're mixing the two, they are completely orthogonal15:51
gnomefreakwhat did i do?15:51
loolamitk: Does this clarify?15:52
loolamitk: I logically need NEON support in all software bits, but the hardware is broken; all NEON support around is correct, and even the binary kernel is ok if you get a new babbage board which doesn't deadlock in some code present in ffmpeg and pixman15:54
dave_m_lool: Is it reasonable to enable the infrastructure for NEON support (CONFIG_NEON, HWCAP_NEON and optionally ld.so hwcap support), but to avoid software which uses NEON for this release? It still feels valuable, because people can develop against the infrastructure when suitable platforms are available.15:57
amitklool: perhaps I was mixing them up. In which case apologies.15:59
loolamitk: No problem, I do agree with you that the hurry was a bad idea, but it's close to beta and I wanted to meet our goals, even if the technical changes are understood only so late   :-/16:00
loolamitk: I feel bad to have put everybody on the nerves about this; I'll do my best to make everything as smooth as possible; concerning NEON we don't need any other change in the tree now; the patches which rtg merged this morning were correct and useful16:01
loolNo need to change CONFIG_NEON either, that part was wrong from me16:02
rtglool: s'ok, I had not gotten around to it yet anyway16:02
amitklool: ack16:02
loolrtg: Do keep your hwcaps changes though  ;-)16:03
rtglool: both patches are in the repo16:03
loolrtg: Saw them, that's great, thanks a lot16:03
dandelapw, how's the build going along?16:33
apwits building now, i had some tooling issues, not tried to build a mainline kernel locally since they were automated and i've broken it along the line16:33
dandeloh fun.16:34
apwdandel, the kernels images you needed should be uplaoded in a couple of minutes, link in the bug18:05
=== anubhav is now known as anubhav_away
dandelapw, ok, i'll get em in about in about 15 min, just to make sure they are completed.18:22
apwthey are done my end18:22
dandelk18:23
dandeli'll put up the log asap.18:26
dandeljust haft to wait for initramfs.18:27
dandellibc 2.6 vs libc 2.7 lol... nice little complain lol.18:28
Keybukrtg: around?18:34
rtgKeybuk: I'm a square18:34
Keybuka square will do fine18:34
Keybukneed an opinion on bug #29671018:34
ubot3Malone bug 296710 in linux "warning: ehci_hcd loaded AFTER uhci_hcd and ohci_hcd" [Undecided,Confirmed] https://launchpad.net/bugs/29671018:34
KeybukI've been doing some investigation, and it turns out that the three drivers have no overlapping modules aliases18:35
Keybukin fact, they each export exactly one18:35
Keybukehci_hcd  pci:v*d*sv*sd*bc0Csc03i20*18:35
Keybukohci_hcd  pci:v*d*sv*sd*bc0Csc03i10*18:35
Keybukuhci_hcd  pci:v*d*sv*sd*bc0Csc03i00*18:35
rtgright, which is one of twe reason I've been confused why link order makes any difference18:35
rtgor load order18:35
Keybuknow, because they don't overlap - there's nothing we can do in userspace18:36
Keybukmodules.order won't help18:36
dandelapw, no log change.18:36
Keybukif we find the USB-1 hub first, [uo]hci_hcd will be loaded before ehci_hcd18:36
apwdandel, you should have a .config with it in /boot18:36
Keybukand that can happen if the laptop has a USB-1 hub, and someone plugs in a USB-2 pccard18:36
apwcan you confirm the entry is in there18:36
Keybukbut I don't understand why there's a warning in the kernel anyway?18:37
rtgKeybuk: can we modprobe it in initramfs ?18:37
Keybukif the modaliases don't overlap18:37
Keybukthen why does it matter?18:37
Keybukrtg: if that's the kind of fix we need, build it into the kernel!18:37
rtgKeybuk: lemme look at the code.18:37
Keybukk,18:37
Keybukbbiab tea18:37
dandelit is set18:38
dandelbut, config_acpi_debug_func_trace is not18:38
dandeli'll put the log on the main kernel.18:39
apwi thought they only asked for the former18:40
apwbut yep stuff it up and see waht they ask for 18:40
apwnext18:40
dandeldone, and should i place it on the launchpad too?18:42
dandelapw, i think your on intrepid :)18:43
apwdandel, yeah put it on there too may as well, helps keep us informed18:43
apwdandel, why so?18:43
dandellaptop is set to hardy18:44
apwi built those images in an intrepid chroot, but the machine was running jaunty18:44
dandelit's got a long string of error due to that ><;18:45
dandelhowever, that's long after the irq is disabled.18:46
apwdandel, odd18:48
dandelhere's the log... http://launchpadlibrarian.net/23949351/dmesg_2.6.29-020629rc8-generic.log18:48
dandelline by line dump as follows... [    2.524473] Pid: 1, comm: swapper Not tainted 2.6.29-020629rc8-generic #118:48
dandelthat's where it starts... i won't past much more since it's over 15 lines.18:49
dandelit ignores my dsdt in the bios18:49
dandelhowever, that's not new.18:50
apw[    0.000000] Unknown boot option `acpi_debug.layer=0x44': ignoring18:51
apw[    0.000000] Unknown boot option `acpi_debug.level=0x08000004': ignoring18:51
apwdoesn't look like the debug is turned on in that boot to me18:51
dandeli set the parameters :D18:53
dandelpfft18:55
dandeli found it18:56
dandelhe should of said... acpi.debug_level18:56
apwdandel, yep, just about to say the very same18:56
dandelyay... now the spam begins.18:57
dandelev_gpe_detect18:58
dandeland ev_fixed_event_detect are large parts18:58
dandel80 seconds of spam b4 it got done18:58
rtgKeybuk: I wonder what the downside would be to build in ehci? I can't see a hardware reason for the warning, just vague warning in google articles that bad things could happen, or that existing USB 1.0 devices will get disconnected when a 2.0 device is inserted.18:59
* dandel adds new log shortly.18:59
IntuitiveNipplertg: I was able to reproduce that on a PC here at one time. Would it help if I figured out which one so we can work on an effected system?19:00
rtgIntuitiveNipple: can you remember how you did it? Insert USB 1.0 device, then a 2.0 HUB ?19:01
dandeluhh... apw... log is too long19:01
dandelit cut out huge parts of it19:02
apwheh nice19:02
IntuitiveNipplertg: I *think* it was just the 'default' boot-up ... I didn't *know* there was a problem until an external USB2 hub with 160G USB2 drive began behaving slowly... then the slowness 'went away' so I never really confirmed the issue 19:02
dandel[  190.652052]    evgpe-0444 [00] ev_gpe_detect         : Read GPE Register at GPE18: Status=01, Enable=80 to [  214.452038]    evgpe-0444 [00] ev_gpe_detect         : Read GPE Register at GPE18: Status=01, Enable=8019:02
dandelthat takes 1.1k lines19:02
rtgIntuitiveNipple: so, its likely you either have a 1.0 hub built in to the laptop, or only have 1.0 devices connected initially.19:03
dandelhmm.19:03
IntuitiveNippledandel: apw: FADT: X_PM1a_EVT_BLK.bit_width (16) does not match PM1_EVT_LEN (4)   ... not necessarily related but...19:03
IntuitiveNipplertg: Yes, it has both USB1 and USB2 hubs... USB1 seem to be used for the internal USB devices19:04
dandelhmm.... i better copy the source file in /var19:04
IntuitiveNipplertg: (four lspci lines follow)19:04
IntuitiveNipple00:1d.0 USB Controller [0c03]: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #1 [8086:27c8] (rev 02)19:04
IntuitiveNipple00:1d.1 USB Controller [0c03]: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #2 [8086:27c9] (rev 02)19:04
IntuitiveNipple00:1d.2 USB Controller [0c03]: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #3 [8086:27ca] (rev 02)19:04
IntuitiveNipple00:1d.3 USB Controller [0c03]: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #4 [8086:27cb] (rev 02)19:04
IntuitiveNipple00:1d.7 USB Controller [0c03]: Intel Corporation 82801G (ICH7 Family) USB2 EHCI Controller [8086:27cc] (rev 02)19:04
rtgIntuitiveNipple: hmm, so it _did_ load the ehci driver, but just not in the right order. correct?19:05
IntuitiveNipplertg: actually, yes, I think this is affected... I've just checked /etc/initramfs-tools/modules and found:19:05
IntuitiveNippleehci_hcd19:05
IntuitiveNippleuhci_hcd19:05
rtgIntuitiveNipple: well, thats what fixed it19:06
IntuitiveNippleSo, I must have thought I needed to fix it. I have dinner now but afterwards I'll change that and do some tests for you19:06
dandeleww... corrupted low memory is in the log19:06
rtgIntuitiveNipple: I'll build a test kernel with ehci built in19:06
IntuitiveNipplertg: I can do that here. I'll get back to you later when I've had time to play19:07
rtgIntuitiveNipple: I think it already is built in.19:07
rtgoh, nm19:07
IntuitiveNipplemodinfo ehci_hcd19:08
IntuitiveNipplefilename:       /lib/modules/2.6.28-9-generic/kernel/drivers/usb/host/ehci-hcd.ko19:08
rtgright, I was looking at ARM configs19:08
IntuitiveNipplegotta go - dinner cooling :D19:08
dandelapw, i got it... but compressing it takes first step19:12
dandelwhich is more useful... syslog messages or kern.log19:14
Keybukrtg: yeah the warning confuses me19:16
Keybukmaybe it's to do with how the kernel decides it's a USB 2.x hub?19:16
Keybukif you have a USB 2.x hub with only USB 1.x devices plugged in, does it look like a USB 1.x hub ?19:16
Keybukso ohci/uhci would claim it19:16
rtgKeybuk: AFAIK19:17
Keybukwhereas the ehci_hcd driver knows better?19:17
Keybukthat sounds rather shaky though19:17
Keybukand almost like they shouldn't have three drivers ;)19:17
dandelapw, want to look through em?19:17
rtgKeybuk: on the other hand, it diesn't make sense that a 1.0 device would cause uhci to load. he driver thats loaded is, after all, based on PCI ID.19:18
IntuitiveNippleright... back. Will rebuild initrd without the module load-order19:18
Keybukrtg: indeed19:18
mjg59Load order is important19:18
rtgIntuitiveNipple: mighty quick dinner :)19:18
Keybukmjg59: but *why* is it important? :)19:18
Keybukthat's what we're trying to understand19:18
mjg59If uhci loads before ehci then it'll chirp 1.0 at the device19:18
mjg59Then when ehci loads the device will already be bound19:19
Keybukmjg59: but given uhci has no device table overlap with ehci - why does uhci even try?19:19
IntuitiveNipplertg: fast eater :D19:19
mjg59Keybuk: I don't understand19:19
Keybuksurely it should dismiss the pci device because it has the wrong pci ids19:19
mjg59Keybuk: No19:19
Keybuk ehci_hcd  pci:v*d*sv*sd*bc0Csc03i20*19:19
Keybuk ohci_hcd  pci:v*d*sv*sd*bc0Csc03i10*19:19
Keybuk uhci_hcd  pci:v*d*sv*sd*bc0Csc03i00*19:19
mjg59Keybuk: ehci doesn't support USB 1 at all19:19
mjg59Keybuk: You have to have uhci or ohci fallback19:19
Keybukdoesn't support USB 1 hubs or USB 1 devices on USB 2 hubs?19:20
mjg59Things get complicated with 1 devices on 2 hus19:20
mjg59At that point I think you're supposed to speak 1 in 2 framing19:20
Keybukwhat I don't get is why, when you load uchi_hcd, it does anything to the PCI device that it doesn't claim19:20
Keybuksurely it should ignore it (just like it doesn't try and speak USB 1 to your sound card :p)19:20
mjg59Which is loading first? uhci or ehci?19:20
Keybuklet's say you have a machine19:21
Keybukit has two PCI devices19:21
Keybukone of which is a USB 2.x hub (i20)19:21
Keybukthe other is a USB 1.x hub (i00)19:21
mjg59No, don't use the word hub here19:21
Keybukwhat's the right word?19:21
mjg59host19:21
Keybukok19:21
Keybukso we have in slot 1 a USB 1.x host (i00)19:21
Keybukand in slot 2 a USB 2.x host (i20)19:21
Keybukwe'll probably load uchi_hcd first19:22
mjg59Right19:22
Keybukbecause of the ...i00* match19:22
mjg59At that point the USB ports will be powered up19:22
Keybukwhy does that module claim the device in slot 2 ?19:22
mjg59It doesn't19:22
mjg59ehci will also be loaded19:22
mjg59But19:22
mjg59When uhci binds, it'll power up the ports19:22
Keybukehci will only be loaded because there's a USB 2.x host as well19:22
mjg59The USB device at the other end will then send a chirp19:22
Keybukthe ports of which host?19:22
mjg59They're the same prots19:22
Keybukor are you saying those two hosts share a single hub?19:22
mjg59As I said, the word "hub" is not meaningful here19:23
Keybuksorry, single port (or set of ports)19:23
mjg59But yes, the two hosts share the same physical ports19:23
Keybukahhh19:23
Keybukso the two PCI devices do not map 1:1 to the ports on the back19:23
mjg59The USB device will then chirp and only get a USB 1 device19:23
Keybukthe ports on the back are shared by both PCI devices19:23
mjg59s/device/response/19:23
Keybukone of which is for a USB 1.x legacy driver and legacy OS19:23
Keybukthe other is for a USB 2.x OS ?19:23
mjg59ehci will then load, but will not reenumerate the devices19:23
mjg59Because they're already bound to uhci19:23
Keybukwhat happens if you load ehci, and you have only USB 1.x hosts?19:24
mjg59They don't work19:24
Keybukbut then if you load uhci ?19:24
Keybukdo they work then?19:24
mjg59Yes19:24
Keybukneat, thanks for the explanation19:24
Keybukso building in ehci would solve the issue ;)19:24
mjg59They'll chirp, ehci will refuse to talk to them and then uhci will reenumerate them19:24
rtgmjg59: so, it seems that the real solution is to build them both into the kernel, making sure ehci is linked first, right?19:25
Keybukrtg: ehci is first in the link order19:25
Keybukthough it's tempting to leave uhci and ohci as modules since they're "less common"19:25
mjg59Yeah, for a reason19:25
dandelapw, the log is up, however, i had to compress it with gz so it would be a manage-able download.19:25
mjg59Keybuk: Built-in USB peripherals like fingerpritn readers are often USB 119:25
Keybukmjg59: the link order doesn't pass down to userspace though, since though it's in modules.order, we never probe an alias that expands to both modules19:25
Keybukmjg59: ahh, interesting19:25
mjg59Because it's cheaper to produce19:26
rtgKeybuk: plenty of old mice that are 1.019:26
Keybukhmm19:26
Keybukit only matters for the host though?19:26
Keybukehci can talk to a 1.x mouse on a port served by 2.x host?19:26
mjg59Keybuk: No19:27
Keybukno?19:27
Keybukahh19:27
mjg59ehci can't speak USB 1 wire protocol19:27
Keybukso to support a 1.x mouse, you have to have a 1.x host?19:27
mjg59Yes19:27
Keybukgot it19:27
mjg59Or plug it into a 2.0 hub which is plugged into a 2.0 host19:27
mjg59The hub does the reframing in that case19:27
Keybukright, the 2.0 hub has a 1.0 host in it, and knows how to transmit that19:27
JanC1.1 works on a 2.0 host, 1.0 doesn't work on 2.0 host, I think?19:27
rtguhci takes over when a 1.0 peripheral is inserted (I think)19:28
Keybukhmm19:28
Keybukwhen you insert a new device19:28
Keybukif you have both ehci and uhci loaded19:28
Keybukhow do they determine which one wins?19:28
mjg59There's some magic in that case19:28
mjg59But ehci always wins19:28
Keybukand that magic doesn't work on first init unless ehci is loaded first?19:29
KeybukI guess you have to know to switch the magic on ;)19:29
mjg59I believe that once ehci has loaded, the root hub is then willing to respond to the 2.0 chirp19:29
mjg59It's handled at a level below the OS19:29
Keybukah, ok19:30
rtgKeybuk: building these modules into the kernel is essentially the same as the initramfs solution.19:30
Keybukrtg: yes, except it works when you don't have an initramfs ;)19:30
rtgcorrect19:30
mjg59The strongest argument for building ohci and uhci in is so you have a working keyboard19:31
mjg59Since they're basically always USB 1.x19:31
Keybukmjg59: yes, I was thinking about that ;)19:31
IntutiveNipple-SThe test PC has just booted... will post the dmesg to the bug report19:31
mjg59Keybuk: This is also a case where suspend/resume ordering is important - if you resume the 1.x host before the 2.0 one, all your dual-speed devices fall back to 1.x19:33
Keybukah, interesting19:34
Keybukit definitely sounds like just building these in is the right solution19:34
KeybukI can't think of any con of doing so19:34
mjg59The only reason this currently works is that ehci always come after uhci in the PCI tree, and we resume devices backwards...19:34
rtgmjg59: are there no OHCI controllers on 64 bit platforms?19:34
mjg59rtg: You can get them on PCI cards19:35
apwKeybuk, i thought someone said they needed to blacklist them for something?19:35
Keybukapw: only reason I can think of would be power?19:35
apwhow will you guarentee init order in these two?19:35
Keybukapw: kernel link order19:36
apwi think that might have been the reason given thinking on it19:36
apwthat you can guarentte it with a probe/probe sequence and not by building in19:36
apwif we can now do that then we win19:36
Keybuk?19:37
KeybukI didn't understand that19:37
mjg59apw: It's guaranteed that drivers will be intialised in the order that they're linked into the kernel19:37
mjg59As long as ehci binds first, you won't have a problem19:37
mjg59Of course, xhci will have to link before ehci19:37
apwKeybuk, i think my memory that undefined init order was a reason given to me to not build them in19:37
apwit sounds like we are relying on and assuming link order as init order so19:38
Keybukapw: the init order in the kernel is very very definitely defined ;)19:38
apwbuilding them in sounds liek its a win19:38
mdzwhat is the maximum amount of physical RAM supported by the 32-bit -generic kernel?19:38
apw4GB19:38
apwbut there is no guarentee you can put 4GB in any one machine19:38
Keybukmjg59: xhci replaces ehci though, right?  I remember reading that it's supposed to be compatible with 2.0 up19:38
apwand see it with a 32 bit kernel19:38
mjg59Keybuk: Oh, maybe19:38
apwsome machines place it elsewhere to avoid making io holes in it19:39
mjg59Keybuk: But if it's compatible you'll need to build xhci in so that ehci can't bind to it19:39
mdzapw: hmm19:39
Keybukmjg59: right, but this makes sense too19:39
Keybukotherwise you then have to remember to load x-before-h-except-after-o-unless-using-e19:39
apwit is common for 4 gb machines to place memory at 0, 1, 2 and 4 gb19:39
mjg59It's guaranteed that you *can't* get 4GB on 32-bit without PAE19:39
apwand you would only see the first three with a non-pae kernel, ie with out 32 bikernels19:39
mjg59Since PCI has to go somewhere19:40
apwmjg59, its guarenteed you won't get 100% of the 4GB yes, but some may surround the io space19:40
mdzwould it be fair to say that it does NOT support 4GB or more?19:40
mjg59Keybuk: It's starting to sound like a rap star19:40
rtgmdz: absolutely correct19:40
mdzs/fair/accurate/19:40
Keybukmjg59: MChci!19:40
apwclose enough for a release note style statement yes19:40
Keybukit would certainly be more accurate to say that "x86 supports machines of less than 4GB of memory" than "x86 supports machines of up to 4GB of memory"19:42
mjg59A PAE kernel obviously gives you support for more, limited by your chipset19:43
Keybukbut PAE kills kittens, apparently19:43
IntutiveNipple-SKeybuk: can we blacklist a driver in initrd ?19:43
rtgmjg59: presumably, on those machines you couldn't stuff more then 4GB19:43
KeybukIntutiveNipple-S: yes, same was as you do after - just run update-initramfs19:43
mjg59rtg: 945 only has 32 bits of data lines on the memory controller, for instance19:44
IntutiveNipple-SKeybuk, so it's enough to add to /etc/modprobe.d/blacklist ?19:44
Keybukrtg: the problem aiui is that once you enable PAE, you cease supporting chipsets without it19:44
mjg59But even with PAE, you can't get the full 4GB19:44
mjg59Keybuk: Chips, not chipsets19:44
KeybukIntutiveNipple-S: and run update-initramfs, yes19:44
IntutiveNipple-SKeybuk, OK, going to add ehci_hcd to the blacklist to see if the physical external ports are shared by the controllers19:44
mjg59You can run a PAE kernel on a core 2 with a 945, even though it's only got 4GB of address space19:44
rtgKeybuk: I'm thinking abuot a new PAE flavour for Karmic, and drop 32 bit server19:45
Keybukrtg: I'd make PAE the default and have a -crusty flavour - but that's me, always forward never looking back at the people crying in my wake ;)19:45
rtgcan't make it the default. too many non-PAE CPUs out there. VIA for example.19:45
IntutiveNipple-Swe need a way to 'stop' external HDDs on USB ... when it is unplugged I can hear the emergency retract19:49
KeybukIntutiveNipple-S: eject them19:50
Keybukeject /dev/sdN19:50
IntutiveNipple-SI tried that ... didn't help. Will re-investigate when I get time19:50
IntutiveNipple-SOK, the other machine has decided to do an fsck... twiddles thumbs19:51
rtgIntutiveNipple-S: its time for ext4 :)19:51
IntutiveNipple-Srtg, most of the volumes on it are ext4... I suspect this one might have remained ext3 for backwards compatibility with Hardy though19:52
IntutiveNipple-SThere's 11 volumes mounted on it, that helps cut down on boot delays from one big fsck19:52
rtgIntutiveNipple-S: sounds like a hell of  a big server19:53
IntutiveNipple-Slaptop19:53
IntutiveNipple-SI just split all the logical work areas into separate LVMs19:54
rtgor, just a big drive with lots 'o volumes19:54
IntutiveNipple-Sso, /home then /home/all then /home/all/Project /home/all/SourceCode /home/tj/ and so forth19:54
IntutiveNipple-Smusic, videos, etc., on separate LVMs too#19:55
IntutiveNipple-Sright the laptop has started. Confirmed ehci_hcd isn't loaded19:55
rtgI just built a server with 4 500Gb disks using raid 0 and ext4. Its significantly faster then a single spindle.19:56
Keybukheh19:57
IntutiveNipple-SYeah, I've got a dell with PERC and RAID-5 in the garage, and a soft md raid-5 with 5 disks in another dev machine. makes a hell of a difference19:58
mjg59It also turns out that fsck is much faster if you can run as many threads as you have spindles19:58
IntutiveNipple-SUSB1 transfer is happening via port 119:59
IntutiveNipple-Sjust need to check which host is in use19:59
Keybukmjg59: something we're vaguely working on now that fsck is moving into util-linux-ng is to have a mount daemon in there that handles the fsck and mount stuff based on either udev or dbus20:00
mjg59There's threaded patches for ext3 fsck. And probably ext4.20:01
Keybukindeed20:01
Keybukthough then you need a flag point where you know that /dev is full20:01
mjg59And SGI implemented it for xfs at some point (their Final Solution)20:02
IntutiveNipple-SIs there an easy way with /sys/ to show the USB host and target path to show which device is connected via which port on which host?20:10
IntutiveNipple-SI've been digging but can't find a clear way to show the linkage20:10
Keybuk$ for DEVICE in /sys/bus/usb/devices/*; do readlink $DEVICE; done20:17
Keybukthe PCI id part of the path will change20:18
IntutiveNipple-Sbeautiful, thanks. I was trying something similar but starting from /sys/block/ (to show the disk-to-usb relationship)20:18
IntutiveNipple-SIf i can get something similar from the block/ path it will make it easy to show which host/port the device in port 1 is on for all permutations of the drivers20:19
IntutiveNipple-SI've got "readlink /sys/block/sdb/device" == "../../../7:0:0:0" but not been able to get the clear path shown like your code does20:20
Keybukjust do a readlink on /sys/block/sdX itself20:20
Keybuk$ for DEVICE in /sys/block/*; do readlink $DEVICE; done20:20
IntutiveNipple-Sgot it! I'd gone one directory too far :)20:24
IntutiveNipple-Sreadlink /sys/block/sdb20:24
Keybuk;)20:24
IntutiveNipple-SI was working inside the sdb/ directory off the device link20:24
IntutiveNipple-Sended up with lots of relative stuff, not realising the sdb/ has symlinked me someplace else in /sys/ already20:25
Keybukthat works too20:25
Keybukit'll just take you higher up in the same tree20:25
IntutiveNipple-Sthis is what i wanted... one line shows the linkage now20:25
Keybuk/sys/block/sda will point at something like /sys/devices/BUS/DEVICE/SCSI HOST/SCSI TARGET/SCSI ID/block/sda20:25
Keybuk/sys/block/sda/device will point at the physical device20:26
Keybukwhich is obviously the SCSI ID20:26
Keybuk/sys/block/sda will point at something like /sys/devices/BUS/DEVICE/SCSI HOST/SCSI TARGET/SCSI ID20:26
Keybukerr20:26
Keybuk/sys/block/sda/device will point at something like /sys/devices/BUS/DEVICE/SCSI HOST/SCSI TARGET/SCSI ID20:26
Keybukthe device symlinks are entirely needless these days20:26
Keybuksince if they point at anything other than an ancestor in the tree (.. or ../.. usually) they're bogus20:26
IntutiveNipple-SI know what we could do with. A patch to ehci_hcd reporting the number of ports it is handling, like uhci_hcd reports20:34
IntutiveNipple-Slol... been slapping the bluetooth mouse because it stopped working... then realised I'd just rmmod uhci_hcd... and the BT module is on an internal USB1 port :D#20:36
IntutiveNipple-Smatters just got worse! when using the touchpad, the mouse cursor gets stuck on X screen 1 if it ventures there, which it now has, but another bug in Xserver means all apps started from screen 1 end up on screen 0 so got no terminal to reload uhch_hcd  ... going home!20:38
IntutiveNipple-Sssh saves the day :)20:39
ernstpI keep getting timeout exceptions from the ata subsystem in Jaunty, never happened on Intrepid21:13
ivokshas anyone reported problems with latest hardy kernel on sparc machines?21:13
ernstpcould it be a kernel bug or will everyone blame my hardware?21:13
ernstphttp://paste.ubuntu.com/132196/21:14
ernstphappens with different bios settings, different sata ports21:15
ivoksdid you check out smart?21:15
ernstponly my ext4 root filesystem during big dist-upgrades, but that's probably because it's such a heavy load21:15
ernstpivoks: no errors ever21:19
IntuitiveNippleernstp: that doesn't look great. Can you post a bug report and attach the system's /var/log/dmesg and /var/log/kern.log containing the error messages?21:46
ernstpIntuitiveNipple: yeah, I decided to give a bugreport a shot21:46
ernstpIntuitiveNipple: only got dmesg though, bit tricky with the logs with the read-only filesystem :-(21:47
IntuitiveNipplehmmm, that bad? how about using netconsole or a serial console to capture the error log?21:48
ernstpI have other filesystems so I have done dmesg > /file21:49
ernstpso I've got that21:49
IntuitiveNipplesometimes dmesg doesn't contain the error reports... but if it does, then we don't need kern.log21:50
ernstphttps://bugs.launchpad.net/ubuntu/+source/linux/+bug/34391921:52
ubot3Malone bug 343919 in linux "[jaunty] ata timeout exception with data loss" [Undecided,New] 21:52
ernstpright :-)21:53
IntuitiveNippleDoes it only affect the one drive? The Samsung SP2504C ?21:58
ernstpIntuitiveNipple: yeah, and only that filesystem22:02
IntuitiveNippleDo you have smartd installed?22:03
IntuitiveNippleas in smartmontools22:03
ernstpI did a smartctl --all on the disk, no errors ever22:04
ernstpIntuitiveNipple: but I'll install that22:05
ernstpIntuitiveNipple: oh, I did have it installed22:06
IntuitiveNippleyeah, I just saw an article about it causing the issue, but that was some time ago now22:07
ernstpIntuitiveNipple: yikes, it can do that kind of stuff... ?22:09
ernstpIntuitiveNipple: well I'll uninstall it then, see if it happens again22:09
IntuitiveNipplecan you paste this report to the bug? lspci -nn22:14
ernstpIntuitiveNipple: write ernstp: so I see when you write ;-)22:16
ernstpIntuitiveNipple: http://paste.ubuntu.com/132216/22:17
ernstpIntuitiveNipple: gonna try running with the mainline kernel 2.6.28.7 for a while22:23
ernstpIntuitiveNipple: see if it still happens22:23
IntuitiveNippleDon't worry, it will :)22:25
IntuitiveNippleStick with the Ubuntu kernel so we can debug it.22:25
ernstpIntuitiveNipple: kk22:28
ernstpIntuitiveNipple: this looks similar.. https://bugs.launchpad.net/ubuntu/+bug/31557222:29
ubot3Malone bug 315572 in ubuntu "alert! /dev/disk/by-uuid/be80cf42-e6f2-466c-bb73-7d664956a334 does not exist" [Undecided,New] 22:29
ernstpok, gotta sleep now, cya22:30

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!