/srv/irclogs.ubuntu.com/2017/03/15/#maas.txt

mupBug #1672947 opened: MAAS accepts 0.0.0.0/0 as a subnet, but this breaks DNS update code <MAAS:New> <https://launchpad.net/bugs/1672947>05:01
ybaumycan i use a apt proxy for one subnet and for another not?05:27
ybaumyor is this a global parameter05:28
=== frankban|afk is now known as frankban
mupBug # changed: 1254807, 1386504, 1441408, 1549397, 1571031, 1582323, 1598175, 1600328, 1602412, 1606508, 1611999, 1614584, 1620478, 1621507, 1628514, 1629982, 1630361, 1630636, 1632853, 1633378, 1633397, 1633401, 1633452, 1633457, 1633462, 1633467, 1633470, 1633600, 1633717, 1633822, 1636250,09:16
mup1636251, 1636324, 1636873, 1636874, 1636992, 1637009, 1637182, 1637192, 1637246, 1637401, 1638284, 1638285, 1638288, 1638575, 1638589, 1639182, 1639247, 1639258, 1639288, 1640147, 1640259, 1640780, 1642033, 1642200, 1643552, 1645067, 1645319, 1645912, 1646162, 1646163, 1646748, 1646891, 1646955,09:16
mup1647703, 1648456, 1648836, 1651452, 1651675, 1655049, 1656208, 1656717, 1657491, 1659152, 1659164, 1659244, 1659511, 1659607, 1659672, 1659959, 1660185, 1660188, 1660858, 1660863, 1660864, 1661214, 1661579, 1663276, 1663290, 1663517, 1663643, 1663686, 1664285, 1664664, 1664667, 1664732, 1664813,09:16
mup1665143, 1665459, 1665478, 1665839, 1667426, 1667754, 1668731, 1668759, 1669213, 1669221, 1669225, 1669226, 1669246, 1669425, 1669428, 1669547, 1669568, 1669570, 1669783, 1669833, 1670326, 1670337, 167082109:16
mupBug #1671897 changed: ui to browse combos of tags is inconsistent with juju's notion of combos of tags <ui> <uosci> <MAAS:Invalid> <https://launchpad.net/bugs/1671897>09:46
mupBug #1671897 opened: ui to browse combos of tags is inconsistent with juju's notion of combos of tags <ui> <uosci> <MAAS:Invalid> <https://launchpad.net/bugs/1671897>09:49
mupBug #1671897 changed: ui to browse combos of tags is inconsistent with juju's notion of combos of tags <ui> <uosci> <MAAS:Invalid> <https://launchpad.net/bugs/1671897>09:52
mupBug #1665482 changed: [2.2] MAAS shows install.log from previous deploy on a machine that failed to commission <MAAS:Fix Released> <https://launchpad.net/bugs/1665482>11:37
rainmakerHi all, is there any way to force maas to use eth1 to probe the BMC of the server? I'm having trouble turning on servers with 2 nics connected since the bmc shares port 113:41
wargamezHi. Is maas 2.0 supported by landscape?14:28
roaksoaxrainmaker: i dont fully understand what the issue is there, but you only would need the rack controller reacheable to the IP address of the BMC, or one on the same subnet14:29
cnfin maas, what is the difference between a fabric and a space?14:34
roaksoaxwargamez: for autopilot? I think they are working in it14:39
roaksoaxcnf: fabric is a swithc or a set of switches14:39
cnfhmm14:39
roaksoaxcnf: in 2.2+ space concept is being changed from L3 to L2. Basically, a space tells that a vlan or a set of vlans can communicate to each other14:39
cnfroaksoax: because the docs say "A fabric is a set of interconnected VLANs that are capable of mutual communication. "14:39
roaksoaxpmatulis: ^^14:40
cnfso i configured my fabrics as if they where spaces14:40
cnfroaksoax: so i can have 10 vlans in 8 spaces  on one fabric, really?14:41
cnfso why would you define different fabrics on maas?14:41
cnfhere, it's all just one big "virtual switch"14:41
roaksoaxcnf: spaces is not mandatory in 2.2+14:42
cnfwell, i was asking in juju how to have juju place things on the right server, because not every server has / should have an ip in every vlan14:43
cnfand i got pointed to spaces14:43
cnf(my main goal is deploying openstack with juju btw)14:43
pmatulisroaksoax, so you want "A fabric is a switch or a set of switches." ?14:43
roaksoaxpmatulis: we can clarify the terms later, but just pointing out that the term in ther is actually ckinda referring to spaces14:43
mupBug #1673087 opened: Save/Load Network and Storage Configurations <MAAS:Triaged> <https://launchpad.net/bugs/1673087>14:44
mupBug #1673091 opened: Tags with dots are not saved <error-surface> <MAAS:Triaged> <https://launchpad.net/bugs/1673091>14:44
cnfok, so i should move everything back into fabric-0 then14:44
cnfi think14:44
pmatulisroaksoax, comes from here: http://bazaar.launchpad.net/~maas-committers/maas/2.1/view/head:/docs/networking.rst14:49
wargamezroaksoax: Yes for autopilot. I am not able to connect to maas 2.X with it it says 401 gone. Is there a maas 1.9 for ubuntu 16.04 available somewhere?14:50
pmatulisroaksoax, please open a doc bug on it with specifics14:50
cnfit has the same info text in the GUI btw14:51
roaksoaxwargamez: i dont think it is released, but they have been working on it14:51
cnfhmm, what a mess :P14:52
roaksoaxcnf: note that fabrics / spaces is a design thing. If all your machines are in the same fabric, and all your machines are connected to the same 'untagged' vlan (i.e. all pxe boot on the same untagged vlan), then yes that sounds all your machines should be in the same fabric14:52
cnfroaksoax: i have a lot more vlans than that14:53
roaksoaxcnf: is you have say X machines pxe boot on untagged vlanX and Y machines pxe boot on untagged vlanY, then it sounds you need 2 fabrics14:53
cnfbut i have 1 vlan just for MAAS14:53
roaksoaxcnf: yeah just an exmape :)14:53
cnfbut here, everything is one switch14:53
cnfwell, about 5 switches, but they behave as 114:53
cnfignoring the vmware distributed vswitches etc14:54
cnfhmm14:54
cnfwhat are the concequenses of having different fabrics?14:54
cnf(seems i can't move VLAN's between fabrics)14:55
cnfso you would have a fabric per top of rack switch, for example, but all pxe boot vlan's would be in one space?14:57
cnfor am I misunderstanding this?14:57
roaksoaxcnf: i sec, otp14:58
cnfsure14:59
roaksoaxcnf: ok, sorry about that.15:03
roaksoaxcnf: say you have this:15:03
roaksoaxMAAS (region/rack) -- switch1 -- node0115:03
roaksoax                   -- switch2 -- node0215:03
roaksoax                   -- switch3 -- node0315:03
roaksoaxcnf: and node01/node02/node03 can talk to each outher in the same vlan15:04
roaksoaxcnf: and PXE boot from MAAS in the same vlan, from the same subnet15:04
roaksoaxcnf: then that would be 1 fabric15:04
roaksoaxcnf: i guess the right term would be that all those 3 switches are trunked15:05
cnfright15:06
cnfi'm not sure when to use different fabrics, i guess15:07
cnfas opposed to using spaces15:07
cnfroaksoax: so if you had a VLAN for pxe booting and maas mgmt, and one for storage traffic, would those be different fabrics?15:07
cnfwhen they are on the same switch15:07
roaksoaxcnf:15:08
roaksoaxMAAS eth0-10.10.10.2 -- switch1.fabric0.untagged -- node01.eth0    -- 10.10.10.1015:08
roaksoax                                fabric0.vlan10   -- node01.eth0.10 -- 192.168.10.2015:08
roaksoax                     -- switch2.fabric0.untagged -- node0215:08
roaksoax                     -- switch3.fabric0.untagged -- node03.eth0    -- 10.10.10.1115:08
roaksoax                                fabric0.vlan10   -- node03.eth0.10 -- 192.168.10.2115:08
cnfok15:08
roaksoaxMAAS eth0-10.10.10.2 -- switch1.fabric0.untagged -- node01.eth0    -- 10.10.10.10   -- space.undefined15:13
roaksoax                                fabric0.vlan10   -- node01.eth0.10 -- 192.168.10.20 -- space.test15:13
roaksoax                     -- switch2.fabric0.untagged -- node0215:13
roaksoax                                fabric0.vlan20   -- node02.eth0.20 -- 192.168.20.21 -- space.storage15:13
roaksoax                     -- switch3.fabric0.untagged -- node03.eth0    -- 10.10.10.11   -- space.undefined15:13
roaksoax                                fabric0.vlan10   -- node03.eth0.10 -- 192.168.10.21 -- space.test15:13
roaksoax                                fabric0.vlan30   -- node03.eth0.30 -- 192.168.30.21 -- space.storage15:13
roaksoaxcnf: or the above too15:13
roaksoaxcnf: so in maas 2.2+ L2 spaces, you can have vlan20 and vlan30 in the same space, each with different subnets, and you are saying basically that machjines in vlan30 and machines in vlan20 can communicate to each other15:14
roaksoaxcnf: on thjose subnets via those spaces15:14
cnfok15:15
cnfso do i need separate spaces for ipv4 or ipv6 ?15:15
roaksoaxcnf: no necessarily, you can have both ipv4/ipv6 subnets in the same vlan15:16
cnfbut in the same space?15:16
cnfbecause technically, they can't communicate :P15:16
cnfhmm, but spaces are per vlan, of course15:17
roaksoaxyeah, so you can have ipv4/ipv6 on vlan10 and ipv4.1/ipv6.1 in vlan20, both in the same space15:17
roaksoaxwhat you are saying there is, ipv4 on vlan10 can communicate with ipv4.1 in vlan2015:17
cnfok15:17
roaksoaxthe same for ipv615:17
cnfso when would you use different fabrics?15:17
roaksoaxcnf: when you have infrastructure that you dont want it to communicate with each other15:19
roaksoaxcnf: for example, you can have 2 different openstack clouds, each on their own fabric15:19
cnfhmm, ok15:19
roaksoaxcnf: the isntances could communicate to each other because they are "public" addresses15:19
cnfand have the same vlans with the same subnets, but there is no link between the 2?15:19
cnfright15:20
cnfok, i'm going to have to jiggle some things around15:22
cnfroaksoax: thanks15:22
roaksoaxcnf: this may help a bit too: https://docs.ubuntu.com/maas/devel/en/release-notes#important-announcements_115:23
cnfhmm15:49
ThiagoCMCHey guys, does the PXE boot stuff works on top of tagged vlans? Like: "eth0.100"? While just eth0 is the regular "ubuntu maas" IP, for accessing its UI, ssh into it, etc...15:57
ThiagoCMCThe server was PXE booted by MaaS, but the commisioning is faling: "Could not query power state: Connection timed out while performing power action.  Check BMC configuration and connectivity and try again.."15:59
ThiagoCMCnot sure what to do...   =/15:59
ThiagoCMCI changed the "power user / pass", to Dell's default but, still doesn't work... I can use those same user/pass on iDrac.15:59
rainmakeranyone here installed openstack? how did you have success? using autopilot or juju deploy openstack-base?16:00
roaksoaxrainmaker: last time i did was with conjure-up and maas 2.216:01
roaksoaxThiagoCMC: if the bios handles it, it should yes16:01
rainmakerroadsoax: i used conjure-up for deploying it on a single laptop.. but will it still work for a production enviornment?16:02
ThiagoCMCHmm... Double checking BIOS settings... Thanks!16:03
ThiagoCMC=)16:03
mupBug #1673135 opened: [2.2b3] Machine fails to deploy , but install log is not immediately stored. <MAAS:Triaged> <https://launchpad.net/bugs/1673135>16:11
cnfhmz16:13
cnfwhat a mess :P16:13
ThiagoCMCroaksoax, "iDRAC -> iDRAC Settings -> Network/Security -> Network -> IPMI Settings [x]Enable IPMI Over LAN" - Worked! Thanks man!16:15
cnfroaksoax: i can't pick what vlan to assign in a fabric on a node16:24
cnfit's grayed out, and only the 1st one is selected?16:24
zeestratrainmaker: I'd look at building your own bundle either from the base examples from https://github.com/openstack-charmers/openstack-bundles or something like this HA example: https://launchpadlibrarian.net/298175262/bundle.yaml16:26
zeestratrainmaker: My experience is that openstack is just not something that is going to work out of the box so we needed to take a moment to look at all the components.16:28
Budgie^Smorezeestrat, I would say that Conanical has made it pretty easy to get a basic clean openstack cluster to work out of the box, my biggest issue with it was (and also my weakness) setting up the servers networking correctly to handle all the different VLANs16:30
zeestratBudgie^Smore: Oh, absolutely. My recommendation is just to take a moment and think about the normal things such as storage and network as the latter usually never works automagically.16:34
cnfhmm16:34
cnfi seem to have painted myself into a corner16:35
cnf"Can't delete fabric; the following interfaces are still connected: eth0 (unknown) on <unknown-node>, eth0 (unknown) on <unknown-node>"16:35
cnfi don't seem to have said nodes...16:35
cnfhmz, how the hell do i clean this up16:41
cnfpostgres stuff it is :/16:44
cnfhmm, so my database is inconsistent16:49
cnfalready?16:49
cnfhow the hell did that happen :/16:49
cnfanyone proficient enough with maas to help me clean this up?16:51
roaksoaxmpontillo: ^^ :)16:52
Budgie^Smorezeestrat oh yeah, you definitely have to get your network layer setup correctly for things to go smoothly. http://blog.naydenov.net/2015/11/deploying-openstack-on-maas-1-9-with-juju-network-setup/ is a walk through that I used when I deployed my first openstack cluster16:57
cnfhttps://bpaste.net/show/b761a057d1c716:59
cnfthat looks wrong...16:59
cnfno node_id16:59
cnfand why can't i pick the vlan on an interface?17:00
cnfhttps://www.dropbox.com/s/m971jqnq8mve3be/Screenshot%202017-03-15%2018.01.03.png?dl=017:01
cnfshould the VLAN field be grayed out?17:01
cnfhmz17:04
Budgie^Smorecnf I think that VLAN is associated with the fabric so it would make sense to grey it out17:04
cnfBudgie^Smore: there are 5 vlans associated with that fabric17:04
cnfand the one selected is NOT the one i want17:05
cnfhmz, i don't understand this ^^;17:06
Budgie^Smoredoes maas allow multiple subnets / vlan?17:09
mpontillocnf: let me take a step back: what specific version of MAAS are you using? (latest in the 2.2 or 2.2-beta series, I imagine?)17:10
mpontilloBudgie^Smore: yes, you can have multiple subnets in a VLAN17:10
cnfmpontillo: whatever was default 3 weeks ago for "apt install maas" on 16.04 :P17:10
cnfuhm,17:10
cnfMAAS Version 2.1.3+bzr5573-0ubuntu1 (16.04.1)17:10
mpontillocnf: ok, thanks. the following query might help us understand why you have stray interfaces on your fabric. https://gist.github.com/mpontillo/94b227942fbcfc79dcad5124927ca9d917:12
mpontillocnf: use "sudo maas-region dbshell" to get a postgres console, then do "\pset pager off"17:12
cnfi'm already on17:12
cnflet me sanitize some ip addresses17:14
mpontillocnf: another useful query is "select * from maas_support__node_networking;" (that's a view) -- but it's more node-centric, and it sounds like unknown interfaces are the issue for you, which are created in certain cases when we don't have a specific node17:14
mpontillocnf: my guess is they're from leftover DHCP leases that MAAS was notified about, and the fact that you can't delete the fabric because of them is a bug17:15
cnfmpontillo: https://bpaste.net/show/8c98a8f2239417:15
cnfthe last query doesn't have any reference to the floating interfaces17:16
cnfmpontillo: so i decided to keep the fabric in question., but i still think it's nasty to have this problem17:21
cnfany suggestions on how to clean it up?17:21
mpontillocnf: I was just about to give you a workaround to clean up all the unknown interfaces so you could delete the fabric. https://gist.github.com/mpontillo/1a6faee09fe674c62dbaab27eb9164f417:21
cnfnice17:22
mpontillocnf: though if that is fabric-0 I'm honestly not sure if we'll let you delete it; that might be the "default fabric", let me know ;-)17:23
cnfthat seems to have done it, i think17:23
mpontillocnf: ok cool17:23
cnfit wasn't 017:23
cnfso17:23
cnfuhm17:23
cnfmpontillo: what happened is i misunderstood fabrics from the docs17:23
cnfand configured them as spaces17:23
cnfthen later learned i should have used spaces for this, so i made the right spaces17:24
cnfand tried to put everything on fabric-0 again17:24
cnfwhich is when i ran into this17:24
mpontillocnf: yeah, we think of fabrics as basically an interconnected set of (non-virtual) switches; traditional switches in which you might use 802.1q VLANs on -- it sounds like you have vswitches in your environment though?17:24
cnfthen i learned that a "fabric" is what is on a "cable" really17:24
cnfmpontillo: yeah17:24
cnfit's a juniper qfabric17:25
cnfwith most things on a 2 x 10g LAG17:25
cnfbut i never did figure out how to pxe boot from a LAG interface17:25
cnfso the maas network is on a separate copper cable, from the same qfabric, really17:25
mpontillocnf: OK, I'd like to better understand your environment; if you don't mind me asking, are you using MAAS on physical or virtual hardware? is this a production or demo/staging type environment?17:25
cnftis is a PoC atm, to evaluate using MaaS / juju to install / manage openstack17:26
cnfthe MAAS controller is on a VM, the rest are physical machines17:26
cnf(PoC is Proof of Concept if you where not familiar with the TLA)17:26
mpontilloOK, sounds good, yes I know that one ;-)17:27
cnfso _normally_ we have 1 copper network (100mbit) for IlO / IPMI access17:27
cnfand everything else on optical LAGs17:27
cnfbut as said, i never found out how to pxe boot on the LAG, so i added a copper network for the MAAS network17:28
cnfit's the same virtual switch, but a separate connection to the machines17:28
mpontillocnf: all right. so you may have two or three VLANs, but it may look like two or three fabrics to MAAS?17:29
mpontillo(it's fine to model it as three fabrics, btw, that will probably mean less headache if you aren't using VLAN tags anywhere)17:29
cnfso what i am learning, i think? is that a fabric isn't a switch, but really a "cable" connected to the machine17:29
mpontillocnf: we use a fabric to model a traditional switching infra with consistent VLAN tags inside. in your case, if you have three completely separate VLANs that can in no way communicate even by fiddling with the VLAN tags, then yeah, telling MAAS it's three fabrics will probably give you the best experience17:30
mpontillocnf: if you have a switch with trunk ports configured where each host interface can retag traffic on the different VLAN IDs, then you'd want to use a fabric with multiple VLANs inside. but it doesn't seem like that's true in your env17:31
cnfwell, it is on the fibers17:32
cnfthe LAG is a trunk17:32
mpontillocnf: okay, then I would model the LAG's VLANs if you want the deployed nodes to be able to configure VLAN interfaces on them17:32
cnfso atm i have fabric-0 with all the VLAN's, a fabric-maas with the maas vlan, and a fabric-mgmt with carries the iLo / IPMI traffic17:33
mpontillocnf: sounds good to me.17:33
cnffor the maas controller, this looks like a separate interface17:33
cnfthe ipmi one, that is17:33
mpontillocnf: that should be fine17:33
cnfcool17:34
cnfso, as I understand it17:34
cnfjuju can't configure networking17:34
cnfi need to configure what i want on a machine in maas, and then juju gets to use what is there, right?17:34
mpontillocnf: yes, that makes sense. so for juju what you might do first is define three spaces to start with: 'mgmt', 'maas', and 'aggregate' - since you're on MAAS 2.1, those would be assigned to each subnet MAAS knows about in those spaces17:37
cnfright17:38
cnfi also addes spaces for public, openstack-mgmt and openstack-storage17:38
cnfwhich are all in fabric-017:39
mpontillocnf: are you able to define 802.1q VLANs (based on VID) inside the LAG fabric? if so then you could define the VLANs and subnets in the 'aggregate' network you want to use with juju, and define spaces appropriate for those17:39
cnfright17:39
cnfmind if i past something in pm?17:39
mpontillocnf: ok. in MAAS 2.1, spaces must be assigned to subnets, so if you've simply created the spaces they aren't really in use until you tell MAAS which subnets they're associated with (as mentioned earlier, in MAAS 2.2 they are migrating to VLANs)17:40
mpontillocnf: sure, if you have sensitive pastes you'd rather not make public, feel free to PM them17:40
cnfit's public ip ranges17:40
cnfi'd rather not broadcast those :P17:40
mpontillonp17:41
cnfdoes that look sane?17:41
mpontillocnf: yeah that looks good to me.17:41
cnfcool17:42
* mpontillo likes the "Available IPs: 100%" next to your /64; I don't imagine that will change much over time ;-)17:42
cnf:P17:48
cnfmpontillo: ok, thanks for your help18:05
cnfi'm calling it a day18:05
ThiagoCMCHey guys, how to configure MaaS in a way that it is not the gateway of the bare-metal servers?18:05
cnf19:00 here, i'm getting hungry and a bit sleepy18:05
ThiagoCMCI tried to delete the gateway on DHCP config but, it reappears...18:05
cnfThiagoCMC: each subnet lets you define a gatway, and cusom routes18:06
ThiagoCMCI want the PXE network to have no gateway, and another interface of the bare-metal boxes will be the gateway...18:06
ThiagoCMCHmm...18:06
ThiagoCMCwell, I just don't want the maas as a gateway, while keep it as metadata / cloud-init as usual.18:06
mpontilloThiagoCMC: you can set the gateway per subnet18:07
ThiagoCMCHow the bare-metal boxes will have 2 gateways without using iproute2 ?18:07
ThiagoCMCOr, does it uses iproute2 multiple tables?18:07
ThiagoCMCfor each gateway?18:07
ThiagoCMCOr just metrics?18:07
mpontilloThiagoCMC: well, iproute2 is used, but it's true that MAAS could handle overlapping default routes better. you can define static routes18:08
mpontilloor rather, overlapping routes in general. we've heard requests for policy based routing18:08
mpontilloThiagoCMC: so let me see if I understand correctly. when you PXE boot you want to use the PXE network's default router for the gateway (such as to reach the Ubuntu archive, etc. but you have another interface you want to use for just about all other traffic (assuming it's online)?18:10
mpontilloThiagoCMC: when you say "I want the PXE network to have no gateway"... that's okay, but I see how that could be problematic with MAAS since during enlistment, commissioning, and deployment we won't bring up your data-plane interfaces18:11
roaksoaxyou can already achieve that18:19
roaksoaxyou cant two default gateways for a machine18:19
ThiagoCMCI know, with iproute2, you can...   =)18:20
roaksoaxif you have a pxe network with the default gateway that is different from having gateway for other eoutes18:20
roaksoaxor viceversa18:20
mupBug #1673204 opened: LXD not getting IP address, MAAS 2.2b3 throwing django.db.utils.IntegrityError and  AssertionError <MAAS:New> <https://launchpad.net/bugs/1673204>18:20
ThiagoCMCmpontillo, oh, I see how that enlistment / deployment can be problematic...18:21
roaksoaxsounds like you need to define your default gatewat forst and then configure routes for other places18:21
=== frankban is now known as frankban|afk
zehoI'm trying to figure out why i'm unable to deploy autopilot from my maas server with juju. I'm looking at the commands.log file and ls18:48
zehoand I can't tell what is wrong. What I do see is the following:18:49
zeho[ERROR: 03-15 13:19:34, gui.py:270] Problem with juju bootstrap.18:49
zehoTraceback (most recent call last):18:49
zeho  File "/usr/lib/python3.4/concurrent/futures/thread.py", line 54, in run18:49
zeho    result = self.fn(*self.args, **self.kwargs)18:49
zeho  File "/usr/share/openstack/cloudinstall/controllers/install/multi.py", line 146, in do_install18:49
zeho    raise Exception("Problem with juju bootstrap.")18:49
zehoException: Problem with juju bootstrap.18:49
zeho[DEBUG: 03-15 13:19:34, error.py:35] showing error view for: Problem with juju bootstrap.18:49
zeho[ERROR: 03-15 13:19:34, task.py:71] ran off end of task list, can't start Bootstrapping Juju18:49
zeho[DEBUG: 03-15 13:19:34, utils.py:627] ssh keys exist for this user, they will be used instead.18:49
zeho[DEBUG: 03-15 13:19:34, multi.py:139] Bootstrapping Juju: JUJU_HOME=/home/maasadmin/.cloud-install/juju juju  bootstrap  --to autopilot18:49
zeho[DEBUG: 03-15 13:19:35, multi.py:145] Problem during bootstrap: '{'err': 'WARNING ignoring environments.yaml: using bootstrap config in file "/home/maasadmin/.cloud-install/juju/environments/maas.jenv"\nWARNING This juju environment is already bootstrapped. If you want to start a new Juju\nenvironment, first run juju destroy-environment to clean up, or switch to an\nalternative environment.\nERROR environment is already bootstrapped\n', 'o18:49
zehoutput': '', 'status': 1}'18:49
firlanyone on that might be able to help me understand the best way to integrate mellanox into a node for maas 2.1?18:58
mpontillofirl: what do you mean by "integrate mellanox"?20:08
firlwhen I commission a host that I can see an ip over ib in the networking20:09
firlmpontillo: I know a year ago it wasn’t at a place to be able to do it. I wasn’t sure if things have changed20:10
mpontillofirl: sorry, I may not be up to date on the terminology for this, but what is "ip over ib"? for MAAS 2.2 I worked on some switch ASIC identification code that runs during commissioning; we have the ability to deploy onto switches, but it can get complex depending on what you want to do20:11
firlsure, infiniband = ib. The drivers that require installation are 3rd party for them to be recognized during commissioning.20:12
firlso the IPoIB is just ethernet over infiniband ( mellanox )20:12
wililupyfirl: can you give me some info on your setup?20:14
mpontillofirl: ah, I see what you mean now. mellanox being a vendor that makes lots of different things it didn't click that you were talking about InfiniBand, sorry. MAAS currently only models Ethernet interfaces, so IB cannot be fully supported, unless you can PXE boot from a non-IB interface and run a script post-deployment to configure IB20:14
firlI can do that20:14
firlis there a way to do a post-deployment script during commissioning?20:14
firlevery host has, 1 pxe net, 1 ipmi net, a 20gb bonded interface, and a dual 54 mellanox port card20:15
mpontillofirl: yes, on the settings page you can upload a custom commissioning script. but doing so may not help since your IB interfaces won't have 6-byte MAC addresses and it's not likely that the MAAS commissioning script will do anything with them. if you're okay with MAAS not knowing about your IB devices, that would be easier (though undesirable for other20:15
mpontilloreasons)20:15
wililupyfirl: When I did this, I had the power control set to my DLI PDU to manage power up/down/status since there is no power control on mellanox switches.20:16
firlya20:16
wililupyYou also need to set PXE as default boot. and set a tag up in MAAS for the serial console so that you can manage it:20:16
firlya that’s already all configured20:17
firlidrac technically but ya20:17
wililupymaas admin tags create name=mellanox-sn2700 kernel_opts="console=tty0 console=ttyS0,115200n8"20:17
firlwililupy mpontillo: so comissioning is one thing, what about deployment when I deploy a node how do I have a 3rd party script run to init the interfaces ?20:18
firlso that I can have juju use it20:18
mpontillofirl: I would do that with a custom curtin_userdata script (found in /etc/maas)20:20
firlok, I will digest this. It might be a few weeks before I will have time to do something that in depth20:20
mpontillofirl: sure. sorry, I meant /etc/maas/preseeds -- and you can have more specific files in there if you want, such as "curtin_userdata_amd64_generic_xenial_myhostname" which would match Xenial deployments [with a generic kernel] to AMD64 machines, where the hostname is "myhostname"20:25
mpontillofirl: you should be able to leave off the [_X] pieces if you want to write a more generic preseed YAML20:25
firlnice, yeah I can do a bash script check to see if the device exists on the system20:26
firlthe hangup for juju to be able to use it originally was that it required post configuration and juju wouldn’t recognize it because it was running before the post configuration could go20:27
firlso I think this could work, not easily managed in a gui, but still work quite nicely20:27
mpontillofirl: yeah, most people just use the GUI for the initial setup and then automate everything from there AIUI20:31
firlya20:32
firlfor the people I need to hand off to, gui’s are easier to understand hah20:32
mpontillofirl: we do have a facility for custom drivers as well (drivers.yaml) but there is a bug where it doesn't work properly in MAAS 2.1 which has been fixed in MAAS 2.2, and it is only for deployment, not commissioning. it can identify hardware by PCI ID and add a custom repo with a DKMS module20:32
firloooo20:32
mpontillofirl: other issue with doing it that way is the movement away from DKMS and toward UEFI boot with signed kernels20:32
firlyeah, I have bios boot for everything because it is more supported20:33
firlchanging the bios on the 50 machines I help manage is a pain20:33
kklimondaI've been seeing this issue randomly over the last few deployments: https://launchpad.net/bugs/1673204 - is there something I can do to debug it further?21:06
kklimondawow, I just read the code that is failing21:08

Generated by irclog2html.py 2.7 by Marius Gedminas - find it at mg.pov.lt!