============================== #opendaylight-integration: s3p ============================== Meeting started by dfarrell07 at 15:31:12 UTC. The full logs are available at http://meetings.opendaylight.org/opendaylight-integration/2016/s3p/opendaylight-integration-s3p.2016-07-29-15.31.log.html . Meeting summary --------------- * Scale/perf weekly meeting, maybe a bit of clustering (dfarrell07, 15:31:40) * Marcus is on honeymoon, enjoy! :) (dfarrell07, 15:32:29) * Jamo is on PTO, enjoy! :) (dfarrell07, 15:32:33) * Nikos is on PTO, enjoy! :) (dfarrell07, 15:32:40) * dfarrell07 has been taking detailed look at NetVirt CSIT, Docker.py CSIT lib (dfarrell07, 15:32:53) * Docker.py lib seems to have much of the logic we need (dfarrell07, 15:33:00) * Looks like we can use Dave's socketplane/docker-ovs Docker image (dfarrell07, 15:33:06) * LINK: https://github.com/socketplane/docker-ovs Best OVS Docker image (to scale in NetVirt test) dfarrell07 has found so far (dfarrell07, 15:33:11) * Dave's OVS Docker image has OVS up to 2.4. Does that work? Need newer? (dfarrell07, 15:33:18) * Docker.py lib doesn't seem to be used in any test, dfarrell07 to work on that next (dfarrell07, 15:33:24) * Luis completed a bunch of clustering lib work this week, OpenFlow next (dfarrell07, 15:33:37) * Al Morton of OPNFV CPerf/IETF (and others?) gave good feedback to Luis about OF tests/metrics, Luis to respond and continue discussion (dfarrell07, 15:33:49) * NetVirt CSIT has a lot of jobs and a lot of failures, dfarrell07 is still trying to figure out which to build from, will talk to Jamo about this when he returns from PTO (dfarrell07, 15:33:58) * GBP also has Docker libs but they are very minimal, not reusable, not what we want (dfarrell07, 15:34:03) * LuisGomez is still looking for people to help with OpenFlow tests (dfarrell07, 15:34:16) * Kumar Rishabh (on call) will be helping Luis with OF tests. Thanks! (dfarrell07, 15:34:53) * Current way of plotting perf results is result/run, which doesn't work well if doing a single run but changing a var like switch scale (vs time for example) (dfarrell07, 15:35:45) * LuisGomez will be on PTO week of 8/8 (dfarrell07, 15:36:11) * Eric + Dileep: Dileep has been doing some profiling work, interested in adding that to monitoring work Kumar is working on (dfarrell07, 15:37:55) * LuisGomez has recent additions to CSIT libs that do JVM monitoring, easy via Robot and ODL logs server (dfarrell07, 15:38:26) * LuisGomez tried this monitoring framework with a CBench perf test very recently, didn't see a perf degradation, LuisGomez thought it would be heavy but it doesn't seem to be (dfarrell07, 15:39:08) * LuisGomez will send out link to wiki with samples/image of monitoring tool (dfarrell07, 15:39:28) * Dileep and Marcus have been talking about profiling and integrating that with tests (dfarrell07, 15:40:10) * Dileep talks about it not being possible to combine data from multiple JVMs, like when running in a cluster (dfarrell07, 15:41:23) * Dileep is looking at using yourkit, collecting and automating, but kinda a mess (dfarrell07, 15:41:50) * Found another tool (missed name) that is built to run in multiple JVMs (dfarrell07, 15:42:05) * This tool can combine JVM outputs to find non-intuitive issues (dfarrell07, 15:42:30) * Another option is Karaf Decanter (which icbts worked on, I think) (dfarrell07, 15:44:06) * LuisGomez reports that framework he has can work with Decanter, anything with JMX, first via Decanter then to DB then to plotting logic, also supports clustering (dfarrell07, 15:44:43) * Keyword for this monitoring detects if in cluster or not, and automagically makes the correct number of graphs (dfarrell07, 15:45:26) * Dileep gives overview of various JVM profiling tools he has investigated, tools to supplement JMX via Decanter (dfarrell07, 15:47:35) * Use of flame graphs specifically seems very nice, Nikos has also pointed these out as a very good goal (dfarrell07, 15:48:01) * Dileep gives demo of flame graphs, show stacks, where time is spent, very helpful (dfarrell07, 15:49:10) * LuisGomez loves flame graph, would love to add, don't have today, wanted to start with simple things and add better profiling info like this (dfarrell07, 15:49:40) * LuisGomez asks Dileep how far along this work is, if it will be ready for automation before whitepaper testing (dfarrell07, 15:51:47) * If we don't have time to get this into automation, can at least do this profiling in Intel env (dfarrell07, 15:52:51) * LuisGomez and dfarrell07 would *love* to see this added to monitoring framework upstream so can be used in CI/many jobs (dfarrell07, 15:53:13) * First step is to get profiling tools installed in RelEng, then get them running in right JVMs, LuisGomez can point Dileep and past patches that do similar things for Decanter (dfarrell07, 15:54:32) * Discussion about keeping graph (now) vs logging to results in DB and creating many graphs via dashboard (future) (dfarrell07, 15:55:43) * Marcus has also been doing work to send data to external DB via Robot, and there's an intern and an Intel person working on dashboards, so may want to align (dfarrell07, 15:56:29) * Int/Test people are pretty damn busy with release atm, but a few Intel people may be able to get this done fairly quickly, good examples/help to follow (dfarrell07, 15:57:39) * Discussion about using Grafana, what TSDR is currently doing/not doing. LuisGomez thinks TSDR stuff is just for ODL DS, not generic info, like profiling results. (dfarrell07, 15:59:02) * icbts will be giving talk at ODL Summit about cool new Decanter features, we should attend (dfarrell07, 16:01:15) * More discussion about where data goes and gets plotted (dfarrell07, 16:02:18) * LuisGomez suggest Dileep go look at work Marcus is doing to put work in external DB, start with that path since this doesn't have to be done super-soon (dfarrell07, 16:03:13) * ACTION: LuisGomez to send intro email to right people, pointers to right patches, to Dileep about sending data to external DB (dfarrell07, 16:04:04) * Dileep's email address is dileep.ranganathan@intel.com (dfarrell07, 16:06:24) * LINK: http://sched.co/7RBf (icbts, 16:09:38) * LINK: http://sched.co/7RBf Link to icbts's ODL Summit Karaf++ talk we should all attend (dfarrell07, 16:13:16) * dfarrell07 gives NetVirt scale test updates, see #infos above (dfarrell07, 16:13:34) * Sanjib gives updates about OF testing, data collection, stats collection scripts/logic (dfarrell07, 16:15:59) * afredette knows about a newer Docker OVS image than the one dfarrell07 described above, will send pointer (dfarrell07, 16:16:25) * Discussion about clustering deploying scripts (dfarrell07, 16:17:29) * LINK: https://lists.opendaylight.org/pipermail/netvirt-dev/2016-July/000700.html Docker OVS image, new and recommended, OVS 2.5 (dfarrell07, 16:18:20) * LuisGomez will give TWS in 2 weeks about clustering lib (dfarrell07, 16:19:54) * Idea is to make it very easy for projects to write their on CSIT with clustering, lib extracted/provided by Int/Test (dfarrell07, 16:20:14) * Can also use this clustering lib in S3P tests, will start with single, see stable, then do in cluster (dfarrell07, 16:20:42) * Discussion about normal isolation in clustering tests, which things run in which VMs (dfarrell07, 16:22:34) * dfarrell07 fixed the meeting info for this meeting on GCal (dfarrell07, 16:25:12) * More discussion about future of clustering tests, work, timelines, plans (dfarrell07, 16:25:40) * LuisGomez points out that cluster just isn't very stable atm, so really need projects to fix that before can do too much more, Int/Test people need to keep testing/sending bugs (dfarrell07, 16:26:24) * dfarrell07 reminds all that Jamo raised a bunch of bugs, especially about clustering, after ONS this year. Projects seem to have dropped the ball on them, we need to go back and push for fixes. (dfarrell07, 16:28:06) Meeting ended at 16:28:10 UTC. People present (lines said) --------------------------- * dfarrell07 (67) * odl_meetbot (5) * icbts (4) * odp-gerritbot (2) * afredette (1) Generated by `MeetBot`_ 0.1.4