08:11:54 #startmeeting multisite 08:11:54 Meeting started Thu Jan 28 08:11:54 2016 UTC. The chair is joehuang. Information about MeetBot at http://wiki.debian.org/MeetBot. 08:11:54 Useful Commands: #action #agreed #help #info #idea #link #topic. 08:11:54 The meeting name has been set to 'multisite' 08:13:08 #topic rollcall 08:13:13 #info joehuang 08:13:23 #info Ashish 08:13:43 #info dimitri 08:13:43 connection issue 08:14:19 hello, Here in China we'll have Chinese New Year soon 08:14:42 and I'll leave for vacation from Feb.4 to Feb.14 08:15:02 Happy new year in advance!!! 08:15:14 cool! 08:15:15 So I can not call for meeting next week and the week after next week 08:15:24 Thanks 08:15:32 do you guys also celebrate on January 1st? 08:15:45 yes. eat food 08:15:59 2016 is a Monkey year 08:16:00 and family time 08:16:19 yes, it's a Monkey year. Even you know that~ 08:16:28 any recommendations from the chinese horoscope? :) 08:17:00 the system has some issue today, often quite automaticly 08:17:19 You’d be surprised but Chinese horoscope is big in Georgia :) 08:18:16 I have no some knowledge about horoscope 08:18:29 only know which year is related to which animal 08:18:41 hi, let's back to kingbird 08:18:55 #topic quota implementation 08:19:03 Yes 08:19:33 I see two ways of implementation, Please comment 08:19:39 There can be two approaches 1) API, JD & JW JD creates multiple JW through RPC call. So there will be 'n' RPC calls for 'n' regions. Also there has to be some way of sharing region specific data back to JD. When JW runs asynchronously then it can be tough to return/share the data to JD. 2) API and Engine(can call JD) Quota Manager will be a separate module in KB and API will call it directly for DB updates. For quota reb 08:19:42 hi Ashish, could you share new idea on the implementation after last meeting 08:20:08 the text above did not go well.. 08:20:14 I am resending it 08:20:20 ok 08:20:28 1) API, JD & JW 08:20:33 JD creates multiple JW through RPC call. So there will be 'n' RPC calls for 'n' regions. Also there has to be some way of sharing region specific data back to JD. 08:20:38 When JW runs asynchronously then it can be tough to return/share the data to JD. 08:21:05 agree. that's why I changed a little in last meeting 08:21:11 data here is resource usage 08:21:18 so there is one more approach 08:21:18 with me 08:21:27 please 08:21:30 2) API and Engine(can call JD) 08:21:43 Quota Manager will be a separate module in KB and API will call it directly for DB updates. 08:21:45 For quota rebalancing, an API request is made to engine(also a periodic task without API request), then engine calls Quota Manager for rebalancing task. 08:21:49 QM uses multiprocessing module of python to create pool of workers each performing region specific task. 08:21:52 So QM will be parent process and workers will be its children. 08:21:56 All the workers start their task asynchronously and which even worker has finished the task, it has to share that data to Parent process(QM). For this, QM also creates a unnamed pipe so that all the workers(child process) have access to it. Worker soon after finishes the task updates the pipe with its information. 08:22:00 QM waits till all the workers finish their tasks by calling .join on them(threading concept). 08:22:03 Also all the workers acquire lock on pipe before writing/reading to ensure synchronization 08:22:08 Then QM has each region information, now it perform calculation and the final limits to update is given back to the processes for limit updation. 08:22:14 We have to cache region specific clients as well so that it can be used during reading and writing. 08:22:17 This way there will be only one RPC call from API to Engine and engine creats pool of workers with multiprocessing module of python, information is shared using pipes 08:23:53 good idea 08:24:27 what happens what a task fails 08:24:39 using multi-processes or multi-eventlet? do you have any comparation? 08:25:02 say we have only 2 successful task executions out of 4. what will QM conclude ? 08:25:06 multi-eventlet is more lightweight 08:25:41 this task should be failed, only if all success 08:25:56 then we can perfrom retry for those failures 08:26:21 I agree with joe 08:26:44 if even one task fails, QM should try again. otherwise quota limits will be out of sync 08:26:47 yes. after one retry the 08:26:48 configured retry times, after configured retry times, let it failed. it'll simplify the design 08:27:07 you mean each sub provess should have a number retries 08:27:10 of* 08:27:31 yes. something like that 08:28:50 would Zhiyuan do a test, to see if multi-eventlet also work for multi-region API calling, and can do it concurrently without a lot overhead 08:29:23 programming for multi-process is not easy 08:30:11 ok, I can do a test that spawns multiple green thread and each green thread GETs some urls 08:30:37 but still we need to share info from threads to its caller 08:30:46 yes 08:30:54 they run in async manner 08:31:17 where ever some one finishes they have to write output in some shared space 08:31:22 Ashish, how do you think about this multi-processing compared to multi-eventlet? 08:31:50 and Dimitri? 08:31:56 #link https://github.com/svinota/pyroute2/issues/99 08:32:05 I wouldn’t go multi-processing 08:32:14 too much hassle 08:32:32 okay. 08:32:53 we need to have multiple worker work parallely 08:33:11 that you can do with threading 08:34:22 we have more control over the process with multiprocess. it is similar to threads 08:35:39 #link http://stackoverflow.com/questions/3044580/multiprocessing-vs-threading-python 08:35:53 what’s wrong with thread group manager? 08:36:31 well your link just gave you the answer :) 08:36:49 I would avoid multi-processing for exactly the same reason 08:37:28 so dimitri also prefer eventlet than multi-processing/threading? 08:37:29 difficult to sync, difficult to share info, expensive to spaw 08:37:46 hmm.. Separate memory space 08:39:03 joe, yes, I’m more inclined to use something more lightweight 08:39:03 here we need share memory for data coming from different region 08:39:42 and for that I would have a look how they implemented it in Senlin 08:39:49 with the concept of Actions 08:40:05 for eventlet, need some test. So I suggest Ahshish to work on the overall code framework, and zhiyuan do a test see if eventlet works 08:40:39 also have a look at Senlin’s actions 08:40:46 yes 08:40:49 their actions operate on clusters 08:40:54 I had introduced multiprocess as its easy to share pipe among parent/child processes 08:41:08 okay.. will look into senlin 08:41:25 no need to go multiprocess 08:41:32 Hi Ashish has alread implemented multi-processing 08:42:00 not see patch yet 08:42:08 no. I have not done 08:42:18 ok 08:42:19 we are yet to finalize the desing 08:42:43 I have done a small hello world program at my end just to get handon 08:42:53 #action Ashish to check Senlin’s implementation of Actions 08:43:02 #action zhiyuan do a test see if eventlet works 08:44:24 And zhiyuan finished two region installation with devstack in tricircle, but one bug left, after the bug fixed, zhiyuan can submit a patch in kingbird too for two region setup 08:44:44 sounds good 08:45:35 currently multi-region setup in devstack not work well, even in openstack community 08:46:22 is there a specific reason? 08:47:03 they changed devstack script, but no one yet to install multi-region via devstack 08:47:12 so the bug not found 08:48:01 I didn’t know that somebody is working on mr in devstack scripts 08:48:39 very usefull it will be. 08:48:42 I am afraid that senlin only deal with VM cluster in one openstack 08:49:06 yes 08:49:41 I think their concept can be extrapolated onto multiple regions 08:50:08 basically they use ThreadGroupManager 08:50:32 there’s a dispatcher that manages actions 08:50:35 worth digging into 08:51:53 then we have to share the information 08:52:23 yes, please share in the m-l. I'll check mail and can do review with phone after Feb 4 08:53:38 and one more thing, API(QM) rpc JW, just bypass the now 08:53:43 share information between the called thread and the calling process 08:54:28 If with multi-eventlet, it's easier, one shared queue 08:54:49 Ashish check ThreadGroupManager and eventlets 08:55:02 OR with threads we can use named pipes 08:55:31 #action Ashish check ThreadGroupManager and eventlets 08:56:50 https://github.com/openstack/senlin/blob/master/senlin/engine/scheduler.py 08:57:30 so we agree that the QM send one RPC out 08:57:32 as I know, the creator of gthread can use green pile to retrieve the return value of each gthread 08:57:48 one way to collect the result of each worker 08:58:06 not QM, the KB API 08:58:50 zhiyuan_: the above threadgroupmanager uses threadgroup from oslo 08:58:58 which in turn uses eventlets 08:59:08 good news 08:59:30 oh, so another convenient oslo lib 08:59:38 I would really really try that track :) 08:59:39 yes.. threadgroupmanager based on eventlet 08:59:44 yes 08:59:47 oslo_service 09:00:26 guys 09:00:28 we need to close 09:00:38 #info https://github.com/openstack/senlin/blob/master/senlin/engine/scheduler.py 09:00:56 ok, we need more frequent email discussion 09:00:58 #link https://github.com/openstack/senlin/blob/master/senlin/engine/scheduler.py 09:01:10 let’s continue this on ml 09:01:16 let's conclude the meeting today 09:01:26 sure 09:01:28 yes, m-l discussion is needed 09:01:41 and share information 09:01:41 ok 09:01:43 thanks guys 09:01:48 see you 09:01:51 bye 09:01:54 #endmeeting 09:01:55 bye 09:01:56 thanks guys. Good bye 09:02:22 #endmeeting