15:02:36 #startmeeting neutron_northbound 15:02:36 Meeting started Wed Apr 20 15:02:36 2016 UTC. The chair is yamahata. Information about MeetBot at http://ci.openstack.org/meetbot.html. 15:02:36 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:02:36 The meeting name has been set to 'neutron_northbound' 15:02:42 #topic agenda bashing and roll call 15:02:47 #info yamahata 15:03:00 #link https://wiki.opendaylight.org/view/NeutronNorthbound:Meetings agenda page 15:03:15 today we don't have update on meeting page. 15:03:25 Do we have any topics for today? 15:03:33 It's less than 1 week for openstack summit 15:03:53 #chair asomya mkolesni oshvartz 15:03:53 Current chairs: asomya mkolesni oshvartz yamahata 15:04:02 just reviews on my side 15:04:15 oshvartz, Same for me 15:04:49 okay, move on 15:04:55 #topic Announcements 15:05:15 next week openstack summit will be held. 15:05:27 So this meeting will be skipped next week. 15:05:43 For neutron developer 15:05:49 #link https://www.openstack.org/summit/austin-2016/summit-schedule/global-search?t=Neutron%3A 15:06:00 #link https://wiki.openstack.org/wiki/Design_Summit/Newton/Etherpads#Neutron 15:06:47 And I'm proposing to gather on Monday and Tuesday around the event registration. 15:06:58 during keynot speach 15:07:54 that's all from me. any announcement? 15:08:43 seems nothing else 15:08:51 #topic action items from last meeting 15:08:59 mostly for yamahata 15:09:29 I've proposed odl gathering offline. 15:09:53 yamahata, surprised to know the meeting is on this channel. 15:09:58 I haven't contacted Kyle for netorking-odl release because there is ci issue now. 15:10:03 gongysh: hello. 15:10:14 It's for historical reason. 15:10:34 gongysh: can you please update meeting calendar for openstack? 15:11:06 yamahata, do you mean my own calendar or somewhere? 15:11:10 Once those issues are fixed, I'll contact Kyle. Or meet him at openstack summit 15:11:30 gongysh: I mean https://wiki.openstack.org/wiki/Meetings 15:11:41 we need to send a patch. 15:11:56 http://eavesdrop.openstack.org/ 15:12:27 gongysh: can we discuss this issue at open mike? 15:12:36 yamahata, sure 15:12:47 okay let's move on bugs/patch 15:12:51 #topic patches/bugs 15:13:09 For now openstack ci doesn't run properly. 15:13:20 also intel ci seems to fail a lot 15:13:43 With https://review.openstack.org/#/c/307933/ "move beryllium snapshot to 4.2 from 4.1" beryllium snapshot and boron snapshot runs 15:14:08 But lithium-snapshot CI doesn't run tempest properly. 15:14:45 Please notice its runtime is 20-30min. It setup devstack but fails to run tempest. 15:15:31 I'd like to raise ODL patch issue. 15:15:36 #link https://trello.com/c/r2qbjSHj/86-enable-maven-site-generation 15:15:59 as ODL TSC decided, ODL neutron northbound should follow it. 15:16:29 That's it from me. 15:16:34 Any other patches/bugs to discuss? 15:16:39 yes 15:16:49 #link https://review.openstack.org/#/c/305132/ 15:17:14 Add journal maintenance thread 15:17:26 i don't have much to add there for now 15:17:49 i think we can proceed with it as is to have the basic maintenance thread functionality there 15:17:56 so i'll be happy for some reviews 15:18:18 I tested it and also added a patch with cleanup operation 15:18:49 Okay. How do we want to coordinate with https://review.openstack.org/#/c/304168/ 15:19:03 #link https://review.openstack.org/#/c/304168/ Journal entries can get stuck forever causing busy wait 15:19:08 yamahata, do we have any BP spec for journal feature? 15:19:31 it seems this project does not follow normal development process. 15:19:33 gongysh: no BP. but we have a page. https://wiki.opendaylight.org/view/NeutronNorthbound:NeutronDriverOverhaul 15:20:05 I think we can merge 304168 (after review) and then we can convert it into maintenance thread. 15:20:13 as I wrote initially I think that 304168 should rebase on 305132 and use the maintenance thread infra 15:20:20 yamahata, then we at least, put this URL on review patch. 15:20:24 then 304168 would be much simpler 15:20:41 asomya: what do you think? 15:21:02 yamahata, asomya had to step away for a few minutes 15:21:12 I can say that it was very easy to add the cleanup operation from "COMPLETED" rows 15:21:14 perhaps you can come back at end of meeting 15:21:51 gongysh: hmm then it would make sense to create blueprint in launchpad. and add links. 15:22:42 gongysh: Right now networking-odl project doesn't allow to create blueprint. We need to contact Kyle. 15:23:31 rcurran: oh. 15:24:45 Now asomya is away. can we continue the discussion at 304168 ? 15:25:19 yamahata, i'm suggesting coming back to that question (which patch should go in first) when asomya returns 15:25:32 rcurran: makes sense. 15:25:43 do we have any other patches to discuss? 15:26:16 #link https://review.openstack.org/#/c/305321/ 15:26:34 Fix delay in sync pending rows 15:27:00 doesn't wait(timeout) work? 15:27:13 I'm a bit surprised. 15:27:24 No there is an issue with the API workers fork 15:28:13 Ah. So initialization code related API workers needs to be improved, right? 15:28:35 Probably we need something like init before/after API worker fork 15:28:42 don't we? 15:28:51 theres no support for it in ML2 15:28:57 from what we saw 15:29:05 ML2 calls initialize on the driver 15:29:11 Okay, that's good topic to discuss at openstack summit ML2 gathering. 15:29:14 Yes, but I'm not familiar with that code 15:29:17 this is before the process fork 15:29:42 we could add an "initialize_child_process" 15:29:47 on the whole chain 15:29:51 mkolesni, correct - initialize() method is only called once per driver 15:29:53 can you please put it https://etherpad.openstack.org/p/newton-neutron-unplugged-track 15:30:10 sure but i'm not at the summit so can't discuss it there 15:30:31 i thought at one point someone was looking into some post neutron-server folks ... long time ago 15:30:34 I'll raise it to ML2 folks, Rob and Shukdev 15:30:57 forks 15:31:09 I think that for now we can stay with the threading.Timer and once we will have a fix for it we can move to wait(timeout) 15:31:10 added it to the etherpad 15:31:30 That makes sense. 15:31:31 rcurran: there were some general neutron issues with that 15:31:42 Can you please add such comment in the commit log? 15:31:44 but they were fixed, nothing for the drivers tho 15:31:57 mkolesni, oh yeah, lots and lots once people started using api/rpc_workers 15:32:18 mkolesni: yeah i was the one investigating it on RH side :) 15:32:23 rcurran: ^^ 15:32:31 * mkolesni should stop talking to self.. 15:32:50 :) 15:33:21 a few md drivers had issues w/ multi-neutron processes 15:33:29 OK, will add a comment on the commit log 15:33:59 #link https://review.openstack.org/#/c/306818/ 15:34:05 Fix race between event write and thread processing 15:35:24 this one is rather simple 15:35:27 For me, there is not moot. 15:35:40 (except coding style etc) 15:36:10 I'd like to wait for somya and rcurran to review. 15:36:10 I fixed the comment on the order of the methods 15:36:38 sorry for delays in reviews ... other work items on plate 15:36:54 #link https://review.openstack.org/#/c/307218 ODL v2: Full sync resources 15:37:14 i think this one is quite ready for now 15:37:25 so i'll be glad to get some reviews 15:37:42 as you can see it's following the design we discussed on the ML 15:37:44 I added then to this patch 15:38:10 Cool. will review it. 15:38:39 i didn't put 2 canary networks yet since i didn't get a feeling we had some sense of what to do there 15:39:04 i.e. on the ML i think this point was kind of left in the air and there wasn't a decision 15:39:20 but this can be added quite easily 15:39:39 also i didnt have time to add the support for "create or update" functionality 15:39:45 okay. it's very details. I think it can be fixed later. 15:39:52 but i think its best to get reviews first 15:39:53 As long as state machine is clean. 15:39:59 exactly 15:40:13 we can always add corner case handling later 15:40:29 mkolesni, this requires https://review.openstack.org/#/c/305132/ to be merged first, correct 15:41:07 last patch from my side #link https://review.openstack.org/#/c/307171/ 15:41:14 Add cleanup operation to maintenance thread 15:41:14 rcurran: yes i split it to allow other usage of the maintenance mechanism 15:41:37 rcurran: also makes it easier to review each independently.. 15:41:38 Also based on 305132 15:42:10 mkolesni, got it and agreed, but need you and asomya to determine which patch goes in first 15:42:28 fyi, he's still not back 15:42:56 this is a maintenance operation for deleting "COMPLETED" rows 15:43:58 regarding to 307171, I though asomy and rcurran intentionally left COMPLETED rows. 15:44:09 oshvartz, does it delete 'create|update' rows even if a delete (with same UUID) isn't in journal? 15:44:10 rcurran: asomya ? 15:44:28 yamahata: it is a problem since the journal will grow indefinitely 15:44:39 making db access slower 15:44:41 instead we can introduce cleanup script which can be run manually. 15:44:49 also mass operations can flood the table 15:45:01 mkolesni, yes understood, we knew this was needed. 15:45:03 mkolesni: yes. Do you observing slowdown? 15:45:37 i don't see why you'd want that but you can have 0 mean the cleanup is off 15:45:44 or -1 15:45:54 0 would probably mean cleanup is immediate 15:46:17 yamahata: imagine on a large cloud with 10000's or resources this table will get filled fast 15:46:28 yamahata: each port status change will trigger a row 15:46:54 yamahata: since the main purpose is for active operation this shouldn't be a DWH table 15:47:08 if cloud admin wants he can set up ETL for that 15:47:53 I'm not against the patch. Instead I'd like to understand the original intention. 15:49:00 we may want to keep some history of rows for debugging. 15:49:18 ok then as i suggest you can play with this value 15:49:37 and even have it that negative value means journal is never cleaned 15:49:40 the value was added to odl config 15:49:42 Probably some tool to dump rows can be introduced later. 15:50:13 i theory a malicious tenant can bring down a cloud like this 15:50:30 Please note that I'm not objecting the patch itself. 15:50:32 by simly issuing an endless stream of updates on a port 15:50:45 ok then we need to think of stability first 15:50:59 i think logs should be the place for debugging 15:51:00 I'm wondering what is the good way to handle complet entries. 15:51:39 at the worst you can change the config value to whatever suits you 15:51:46 mkolesni: logs makes sense. 15:51:48 including turn off the cleanup if you want 15:52:06 mkolesni, yamahata - i think we've always known that we need to eventually remove rows that are no longer needed. but needed to baby step to get initial code working 15:52:34 rcurran: in that view this patch makes sense then 15:52:38 my first thought would be to only remove those journal rows that have a completed deltete 15:52:49 delete 15:53:09 what do you mean "a completed delete"? 15:53:28 That the object was deleted ? 15:53:31 when a delete+object+uuid is complete then remove all rows w/ that UUID 15:54:00 since then we know the controller has deleted the resource 15:54:44 You don't think that this logic is unnecessary ? 15:54:50 again youll have very quickly a full table 15:54:54 with slow access 15:55:13 lets KISS 15:55:25 and delete the rows that are completed 15:55:52 if you're a brave/stupid cloud operator that wants no delete then turn it off 15:56:02 sorry, i haven't looked at the patch. do you remove journal rows that are create|updates (but no deletes)? 15:56:27 based on the entries being there for a (long) time period? 15:56:31 its removing all rows that are in state = completed 15:56:44 that have been there more than the configure time 15:56:58 mkolesni, but w/ no 'delete complete' rows for that UUID 15:56:59 the time is configured in the ini file 15:57:45 if you mean 'delete complete' that the resource is deleted from the system, then no this is not checked 15:58:04 and i dont believe theres a reason to do this 15:58:25 I agree with mkolesni that for debugging we have logs 15:58:39 for a running system this table should be either very small size, or empty 15:58:51 otherwise we will hit performance and/or space issues 15:59:04 mkolesni, ok, that's my point. i'm thinking about replay the journal 15:59:18 rcurran: interesting. 15:59:21 rcurran: for what purpose? 15:59:34 It can be used for testing/debug 15:59:44 rcurran: for debug just set the time to 999999999 15:59:58 or -1 to turn it off, if we decide this makes sense 16:00:09 sorry i have to go 16:00:24 okay, let's continue as review. 16:00:26 im guessing we will talk in 2 weeks 16:00:34 enjoy the summit! 16:00:37 lastly I'd like to raise https://review.openstack.org/#/c/295368/ 16:00:42 I think in production you won't want the table to be very bug 16:00:44 bug 16:00:45 I'd like to include the patch for mitaka release-odl. 16:00:46 big 16:01:13 #link https://review.openstack.org/#/c/295368/ 16:01:17 any other issue? 16:01:23 #topic open mike 16:02:32 okay thank you every one 16:02:38 I saw that we don't have a tag for a long time 16:02:58 oshvartz: what do you mean by a tag? 16:03:25 I mean for liberty i didn't see a tag in the git repo 16:03:42 Got it. git tag. 16:03:47 yes 16:04:47 When are we planning to do one for Mitaka 16:04:52 I see. It's a problem. 16:05:07 Soon. Once we ask Kyle, we can have a tag/branch. 16:05:21 For future, we should have the controle. I'll talk with Kyle. 16:05:23 I think we need both 16:05:57 branch for mitaka and also a tag for the release 16:06:50 I agree. 16:07:14 The current governance has a bit problem. There is no active coredev who has maintenance rights. 16:07:30 OK 16:07:54 Let's resolve it. I'll send a mail with Cc to you and others. 16:08:05 great 16:08:20 #action yamahata send a mail to Kyle on governance. 16:08:53 any other issue? we run over several minutes. 16:09:33 okay, thank you. 16:09:42 see you at openstack summit 16:09:46 #topic cookies 16:09:52 #endmeeting