#opnfv-cperf log

18:19:00 <dfarrell07> #startmeeting cperf
18:19:00 <collabot> Meeting started Thu Jul 21 18:19:00 2016 UTC.  The chair is dfarrell07. Information about MeetBot at http://wiki.debian.org/MeetBot.
18:19:00 <collabot> Useful Commands: #action #agreed #help #info #idea #link #topic.
18:19:00 <collabot> The meeting name has been set to 'cperf'
18:20:50 <dfarrell07> #info Hello World CBench job has been merged, help from Tim of Apex, not currently triggered but jamoluhrsen is working on it
18:23:39 <jamoluhrsen> #link https://git.opendaylight.org/gerrit/#/c/37639/ <--- patch with maybe relevant ovsdb scale test stuff
18:23:49 <dfarrell07> #info Nikos has been stuck doing internal stuff last two weeks, has been doing some prelem testing with multinet running in Docker containers, have been able to get mininet topos running with kernel-based OVS
18:24:03 <dfarrell07> #info Required tuning, like system file limits, to get mininet running
18:24:22 <dfarrell07> #info were able to get switch scale tests running with about 3k switches, 16 docker containers running multinet
18:24:39 <dfarrell07> #info up to 1.5k switches, perf seems to be same as VM test
18:25:07 <dfarrell07> #info may be problem with too few resources for containers, not an issue in kernel world, still digging
18:25:43 <dfarrell07> #info using VMs is an alternative of course, but doing container tests to figure out for sure if there's a scale issue in the kernel, to know if we really need VMs for kernel isolation
18:26:31 <dfarrell07> #info also working on NB traffic generation logic, to make traffic stateful, to track on which switches have flows installed, enable more complex scenarios like ones Al is talking about with topo failures
18:27:12 <dfarrell07> #info question about if ODL enables NB app to get notifications about if network changes, like if change in data store, LuisGomez says it should be possible using websockets
18:27:42 <dfarrell07> #info flows get updated through stat reports, so don't get exact time flows are installed or as up-to-date notifications
18:28:25 <dfarrell07> #info LuisGomez asks question to NIkos about when pushing tons of flows into OVS, it takes a bit to get stats/flow counts from OVS, how do they do that/are they seeing this delay
18:28:58 <dfarrell07> #info LuisGomez is trying to figure out if our test script is slow to collect this data or if OVS is just slow at reporting these stats
18:31:48 <dfarrell07> #info Detailed discussion about metrics collection and timing, tech details of implementation
18:32:52 <dfarrell07> #info Nikos doesn't think he's been collecting this number, LuisGomez asks him to check if possible
18:33:23 <dfarrell07> #info Programming 100k flows is super fast with bulk-o-matic, like 5s, but it also takes like 5s to get stats, so makes it difficult to measure metric
18:33:54 <dfarrell07> #info LuisGomez is interested in better ways to collect this data, if Nikos has such a way
18:34:21 <dfarrell07> #info LuisGomez asked this of Verex test people last week as well, no response so far
18:35:34 <dfarrell07> #info LuisGomez is looking at tests we want to run in next ODL whitepaper
18:35:51 <dfarrell07> #info One good one is REST NB flow programming
18:36:04 <dfarrell07> #info another is measuring flow programming on SB, LuisGomez needs to look into tooling for that
18:36:30 <dfarrell07> #info Nikos is measuring this type of thing with NSTAT already
18:37:23 <dfarrell07> #info dfarrell07 asks about Peter and Jan joining whitepaper efforts, Vratko says that Peter may be joining
18:38:48 <dfarrell07> #info Plan is to build tests on ODL infra, then copy robot logic to Intel lab to get numbers on HW
18:41:19 <dfarrell07> #info Cluster testing will not be a different test, there will be a test, and it will be done with single and cluster (if applies)
18:42:32 <dfarrell07> #info Current test are fairly black box, so should be about the same for 1 node or 3 or whatever
18:42:40 <dfarrell07> #info may run into differences if sending flows to leader or follower or whatever
18:43:08 <dfarrell07> #info dfarrell07 would like to bring cluster devs into this conversation, to get their feedback about tests
18:45:42 <dfarrell07> #info Discussion about data store vs RPC drop tests, we're not doing them in ODL because they are unstable, Nikos is doing DS test but is seeing lower numbers
18:14:38 <collabot> dfarrell07: Error: Can't start another meeting, one is in progress.  Use #endmeeting first.
18:14:42 <dfarrell07> #endmeeting