20:00:07 #startmeeting ovsdb_weekly_call 20:00:07 Meeting started Tue Feb 10 20:00:07 2015 UTC. The chair is tbachman. Information about MeetBot at http://ci.openstack.org/meetbot.html. 20:00:07 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 20:00:07 The meeting name has been set to 'ovsdb_weekly_call' 20:00:13 #chair shague 20:00:13 Current chairs: shague tbachman 20:00:16 #chair flaviof 20:00:16 Current chairs: flaviof shague tbachman 20:00:23 #topic agenda 20:00:42 #link https://meetings.opendaylight.org/opendaylight-ovsdb/2015/ovsdb_weekly_call/opendaylight-ovsdb-ovsdb_weekly_call.2015-01-27-20.00.html Last recorded meeting minutes 20:02:57 48 members — getting up there :) 20:03:21 #topic Trello Board 20:03:59 #link https://trello.com/b/ddIvDQE0/ovs-openstack Trello Board for the OVSDB project 20:05:48 #info shague says the NSF migration is waiting on go-ahead from michal_rehak 20:06:21 #info vishnoianil and shague have pushed patches to remove AD-SAL pieces 20:06:42 #info shague and vishnoianil working on replacing AD-SAL Node with MD-SAL Node 20:06:54 #info shague eventually replace activation with the config subsystem 20:07:05 #info edwarnicke offers his help with the config subsystem 20:07:39 #info The Neutron Services is eligible for a creation review at this Thursday’s TSC meeting 20:07:52 Flavio Fernandes proposed a change to ovsdb: Fix ovsdb-ovs-full-integration-daily https://git.opendaylight.org/gerrit/15119 20:07:53 #info colindixon says it’s hard for him to imagine the Creation Review not going through 20:08:15 flaviof: mixing up commits in our meeting minutes :x 20:08:16 lol 20:08:48 debugging a network live via webex 20:08:51 lol 20:09:07 #info flaviof says a lot of progress has been made on networking-odl plugin 20:09:29 #info the openstack folks want to remove everything that’s not pure openstack and turn it into a plugin 20:09:52 #info flaviof says there’s still work to be done to connect things like LBaaS, FW, and L3 20:10:04 #info flaviof says the critical piece is the integration with gerrit and tempest testing 20:10:53 #info Swami asks what the services work is — pulling from a repo? 20:11:07 #info flaviof says the first one is a final gerrit to remove all the ODL stuff from the devstack repo 20:11:56 #info flaviof says plaurin_ is working on the “entry points” in the config, and is looking for help from Swami and mestery, to take the out of tree work and plug it in (i.e. not use the L3 agent way, but now pass this responsibility to ODL) 20:12:20 #info shague says that stuff is important for supporting the back-end tempest testing so that OpenStack and ODL can integrate 20:12:45 #info LuisGomez asks shague if they’re trying to get this testing going in the LF infrastructure 20:12:48 #info flaviof says yes 20:13:15 #info LuisGomez asks if they’ve gotten any input from the LF infrastructure folks — he’s been looking for the same things to support the needs of the integration team 20:13:43 #info flaviof says the task right now is to make sure that openstack doesn’t break ODL — i.e. a changing openstack vs. a stable ODL release 20:14:03 #info flaviof says that LuisGomez is probably looking for the opposite — ODL changes affecting external projects 20:14:07 #info LuisGomez says that’s the case 20:14:29 #info LuisGomez asks if flaviof has had any issue with Vagrant, etc.? 20:16:12 #info flaviof says there’s a jenkins job in OVSDB (not yet in JJB) that spawns a special VM and does an all-in-one openstack deployment 20:16:33 #info flaviof says that tempest is one of the things they’d do there 20:16:42 #info LuisGomez asks if Vagrant is used in this setup 20:16:58 #info flaviof says no, not in this one, and has heard from LF folks that doing things in Vagrant is a nightmare 20:17:45 #info shague says the goal is a multi-node deployment 20:19:33 #info edwarnicke has submitted a bunch of new patches for the MD-SAL OVSDB southbound 20:19:56 edwarnicke: https://git.opendaylight.org/gerrit/14877 20:19:59 #link https://git.opendaylight.org/gerrit/#/q/status:open+project:ovsdb+branch:master+topic:mdsal_ovsdb_sba <- ovsdb mdsal sb patches 20:20:08 edwarnicke: thx :) 20:20:19 #info You can do both active and passive mode connecting 20:20:30 #info snackewm provided a patch so that the passive-node delete works 20:20:48 #info There are patches that bring in the infa for the monitor requests 20:21:08 #info There’s also a patch that listens in the config store for the OVDSB bridge nodes and retrieves the client 20:21:57 #info edwarnicke says we need to get some of the data in the operational data store before pushing things down; would be nice to be able to look up th bridge to determine handles for talking to different elements 20:22:09 #info edwarnicke says it’s the translation between the model and the row/column table objects 20:22:20 #info vishnoianil says taht will be done by the 12th 20:23:10 #info shague asks if there’s any UT for this 20:23:20 #info edwarnicke says he’ll feel much better as more tests are put in place 20:24:01 #info edwarnicke asks about the docker-based testing 20:24:45 #info flaviof says the docker stuff was useful for version testing against different revs of OVS 20:24:51 #info edwarnicke asks if there’s a version in the schema 20:24:57 #info shague says that’s a field in the tables 20:25:47 #info edwarnicke asks for some examples in the ways that schema shifts between versions 20:26:05 #action flaviof to come up with concrete examples of schema shifts 20:26:57 #info shague says the nicira extensions for tunnels are one example; used to be part of the port, now are flow-based 20:27:51 #topic clustering 20:28:36 #info ed asks colin to be nice 20:28:38 #info colindixon says that clustering comes down to 3 things that are hard in the current model 20:28:41 flaviof: lol! 20:28:41 #undo 20:28:41 Removing item from minutes: 20:28:53 flaviof: you undid me :) 20:28:53 #info notes Colin is usually nice, if perhaps sardonic ;) 20:29:02 lol 20:29:04 lol 20:29:40 #info colindixon says that clustering comes down to 3 things that are hard in the current model 20:29:43 #info colindixon says you need to figure out what the state is, then you have to model it, and then you have to make your data change notification listeners to work correctly 20:29:49 flaviof: thx ;) 20:29:59 sorry tbachman u r too fast 20:30:01 flaviof: now you stole the minute credit ;) 20:30:02 lol 20:30:06 lol 20:30:14 * tbachman is just having a go with flaviof 20:31:38 #info shague says with the OVSDB use case, it pulls in the DB, and when this goes to clustering, how does this map in ODL 20:32:21 #info colindixon points out that clustering does not address failover — just recovery of state 20:33:43 tbachman: you're slwoing down 20:33:48 lol 20:33:59 cdub: colindixon burnt out my fingers 20:34:00 nothing but nubs 20:34:22 ;) 20:34:24 cdub: we have a recording :) 20:34:51 #info For deep discussion on clustering and failover behaviors, please see the webex recording at about 30 minutes in 20:34:54 cdub: ^^^^ 20:35:02 lol 20:35:18 * tbachman exercises creative laziness 20:35:22 tbachman: well done 20:35:28 lol 20:36:01 #info vishnoianil says all the connection caching is at the library level 20:36:38 #info vishnoianil says in a clustered configuration, it’s okay if a node goes down; but what happens if a heartbeat is lost and a new leader is elected (i.e. connection is stil there, even though the cluster leader went down) 20:36:59 that's a core part of clustering 20:37:02 #info colindixon says that has to be handled somehow out of band 20:37:32 it's split brain, and you need to fence in some way 20:37:34 #info colindixon says it is possible for a 3-node cluster for one node to be operating in the past, yet still have connections to local things 20:37:51 #info cdub says this is a split brain scenario, and it needs to be fenced in some way 20:38:22 #info colindixon says he doesn’t know if OVSDB provides any such mechanism 20:38:42 #info vishnoianil says there are two connections involved — OpenFlow plugin and OVSDB 20:39:07 #info colindixon says one way to deal with this is to piggy-back on openflow (whoever OpenFlow controller is, have OVSDB follow) 20:39:17 cdub: white picket fence? 20:39:22 tbachman: heh 20:39:31 #info colindixon asks if OVSDB has any way to deal with this 20:39:40 #info shague says you can have multpiple managers 20:39:51 #info colindixon asks if you can ask an OVSDB manager who it’s managers are 20:39:57 #info shague says there is a table for this 20:40:11 #info colindixon says that this can be used — where the manager writes themselves into the table 20:40:19 #info shague says you can also be a listener 20:40:25 #info colindixon asks if this state is kept in the table 20:40:35 #info shague says he doesn’t believe so 20:41:05 #info colindixon says that if someone’s going to take over node management, you would have at most 1 instance of the OSVDB plugin attached to the switch at any given time 20:41:43 #info you can have all the nodes in the cluster connected, but only one node desginating themselves as the master somehow 20:43:10 white picket fence == stonith 20:43:20 lol 20:43:30 #info edwarnicke asks about the use of external_ids in the OVSDB tables 20:43:42 #info edwarnicke asks if we can write an external_id for whatever we consider to be the master 20:44:11 there's also other_config 20:44:16 #info shague says external_ids are used so the node can identify itself to the management piece 20:44:22 #info cdub says the other_config can be used 20:45:06 #info edwarnicke says it feels like doing this allows us to determine the rightful owner, so that stonith can be performed 20:45:18 * tbachman prefers stonith to “shot in the head" 20:45:19 :o 20:46:06 #info edwarnicke says one way to discover the master is gone by listening to updates on the managers table 20:46:30 #info colindixon says that only works if every cluster instance is connected to every node — otherwise when the master goes down, you stop getting updates 20:46:52 great lesson in managing distributed state :) 20:47:59 #info the advantage of the complete connectivity model is that failures in the cluster, it’s all internal 20:48:34 * tbachman has to duck out for a sec…. 20:51:03 * tbachman jumps back in 20:51:30 #info colindixon says that load balancers are sometimes deployed between teh devices connecting to the controller and the controller 20:51:47 #info edwarnicke says load balancers would have to understand all the way down to the guts of the OVSDB protocol 20:52:16 #info colindixon says first thing first: worry about internal cluster state; address failover second 20:52:40 #info shague says the main request was to get the clustering support in; sounds like this addresses the needs of the users 20:53:03 #info edwarnicke agrees, but wants to make sure that controller node local and internal state is the only data cached (e.g. in hash-maps) 20:53:37 #info shague says the schema is down in the lib; if the node goes away, the schema goes away 20:54:01 #info shague says in order to drive OVSDB, you need that schema; on a reconnect, you get the schema back 20:54:32 we've had this theory going for a while now, sounds like we should make test case around this 20:54:39 #info colindixon says do not store any state in the cluster that you can easily rebuild 20:54:43 cdub: +1k ;) 20:55:06 #info colindixon says it’s a lot easier to handle the lazy loading of caches 20:57:35 #info flaviof says there are a bunch of gerrits that edwarnicke needs merging — asks if shague or vishnoianil can look at these 20:57:45 #action shague and vishnoianil to look at edwarnicke’s gerrits 20:58:33 #info colindixon says his suggestion is to use the gerrit comments where possible, moving to live discussion if needed 21:02:27 * tbachman stops scribing during discussion on TDD vs. other methods 21:02:50 tbachman: that’s right 21:03:04 uh-oh, somebody pushed the wrong button 21:03:07 lol 21:03:49 tbachman, awesome minutes 21:03:58 tbachman, Thank you! 21:03:59 dneary: glad to help 21:04:04 np! :) 21:04:11 #endmeeting