#fdio-vpp: fdio-vpp
Meeting started by DaveBarach at 15:00:28 UTC
(full logs).
Meeting summary
- 
  - mackonstan (mackonstan,
    15:02:59)
 
 
- CSIT (maciek reporting) (DaveBarach, 15:04:34)
  - Physical and virtual infrastructure
    updates (mackonstan,
    15:04:47)
- Vexxhost DC move almost done, last four servers
    will be moved from MTL1 to YUL1 tomorrow, and we are done with the
    phy machines move. (mackonstan,
    15:05:20)
- Mgmt/IPMI IPv4 addr renumbering to happen
    shortly, to put all hosts in the same subnet(s). (mackonstan,
    15:05:58)
- Open item1: OpenStack vRouter still used for
    accessing LF IT VM applications left behind in MTL1 (jenkins master,
    gerrit, etc) (mackonstan,
    15:06:48)
- Resolution1: LF IT VM apps will move to YUL1 in
    the next few weeks, and then all problems should go away.
    (mackonstan,
    15:07:05)
- Open item2: intermittent (much less frequent
    now after we went into full daily esca calls with involved parties)
    git fetch failures and jenkins connection resets. (mackonstan,
    15:07:18)
- https://secure.vexxhost.com/billing/viewticket.php?tid=NOB-607778&c=4Dp0GdHT
    (mackonstan,
    15:07:26)
- Resolution2: Continue daily 15min calls for
    situation review with involved parties, until all parties satisfies
    and min 2-day uninterrupted operation evident. (mackonstan,
    15:07:38)
- Test breakages: (mackonstan,
    15:09:40)
- NAT44ed multi-worker keep testing
    intermittently, less frequently after recent patch, but still vpp
    crashing. (mackonstan,
    15:09:51)
- Sporadic VPP crashes in get statistics.
    (mackonstan,
    15:10:15)
- Few other under investigation. (mackonstan,
    15:10:58)
- Work highlights: (mackonstan,
    15:11:18)
- CSIT in AWS - 2-node and 3-node tests running
    smoothly, ENA DPDK driver making VPP packets drop on tx. Moving
    ahead with Jenkins onboarding, will be publishing results for a
    subset of CSIT tests in CSIT-2106 report. (mackonstan,
    15:11:25)
- Merging VPP & Linux telemetry - VPP perfmon
    bundles, Linux bcc/bpf tracing tools, using OpenMetrics format for
    storage and post-processing. (mackonstan,
    15:12:06)
- Moving to json models for test oper data and
    results storage, querying and post processing. Would be good to hear
    from vpp-dev community what queries people would like execute
    against CST test result data e.g. over specific time period or for
    specific git patch period to say verify specific patch(set) impact
    on things. (mackonstan,
    15:13:19)
- Ongoing work to make TRex behaving as a
    deterministic and reliable traffic generator at high 100GbE
    rates. (mackonstan,
    15:13:31)
- Revamp of ipsec tests, as CSIT suffering from
    test suite overload (269 tests at last count). See Maciek recent
    patches for tests being axed, under review. (mackonstan,
    15:13:43)
- Generic effort to reduce number of tests,
    remove redundant packet path testing. See Maciek recent patches,
    under review. (mackonstan,
    15:13:56)
- Other CSIT-2106 work, see link (mackonstan,
    15:14:10)
- https://wiki.fd.io/view/CSIT/csit2106_plan
    (mackonstan,
    15:14:17)
 
 
- Host Stack(Florin) (DaveBarach, 15:14:41)
  - lots of patches in the last month (DaveBarach,
    15:14:59)
- improvements in session layer for
    connect/listen APIs - Lots more config knobs (DaveBarach,
    15:15:25)
- working to improve active-open
    performance (DaveBarach,
    15:15:44)
- moving active-opens to the first worker since
    the main thread tends to sleep a lot (DaveBarach,
    15:16:03)
- improve half-open connection tracking
    (DaveBarach,
    15:16:15)
- bunch of TCP cleanup, bulk buffer
    translation (DaveBarach,
    15:16:57)
- improvements in vcl test code, server
    (DaveBarach,
    15:17:12)
- now have a DTLS vcl test (DaveBarach,
    15:17:49)
 
 
- Documentation (Ole reporting) (DaveBarach, 15:18:19)
  - need to find a home for documentation, e.g. to
    auto-update main website docs (DaveBarach,
    15:19:12)
- dwallace: LFN has a license for
    readthedocs (DaveBarach,
    15:19:50)
- any community volunteers for maintaining /
    writing docs more than welcome (DaveBarach,
    15:20:35)
- dwallace: need to help e.g. Google find
    up-to-date docs (DaveBarach,
    15:21:12)
 
 
- Release Mgmt (Andrew) (DaveBarach, 15:21:28)
  - 21.06 RC1 in a few weeks (DaveBarach,
    15:21:58)
- 5/25 (Weds) will pull the release
    throttle (DaveBarach,
    15:22:13)
 
 
- Coverity (DaveBarach, 15:23:02)
  - look at list on github, broken out by
    owner/maintainer (DaveBarach,
    15:23:27)
- https://github.com/vpp-dev/vpp-coverity-report
    (DaveBarach,
    15:26:20)
- vppapigen "training wheels" to be removed in
    this release (DaveBarach,
    15:27:47)
- vppapigen added message status (experimental,
    production, etc)  to JSON (DaveBarach,
    15:28:27)
 
 
- Infra Status(DaveW) (DaveBarach, 15:29:05)
  - three intermittent false failures: punt tests
    fixed (DaveBarach,
    15:29:32)
- vpp device job fails when 2 jobs run / both try
    to reconfigure the i40e at the same time (DaveBarach,
    15:29:59)
- intermittent vcl / ldp make test failure on the
    arm platform (DaveBarach,
    15:30:18)
- "that one is driving me crazy..." (DaveBarach,
    15:30:38)
- reenabled Naginator to (temporarily) address
    Jenkins comms reset problems (DaveBarach,
    15:31:14)
- trying to avoid Vexxhost virtual-router
    bailing-wire / bubble-gum to improve network reliability
    (DaveBarach,
    15:32:06)
- DW spending hours/day updating vexxhost ticket
    w/ data (DaveBarach,
    15:32:40)
- proposal to use vpp instead of current  virtual
    router technology, early stage discussions (DaveBarach,
    15:33:24)
 
 
- make test (cont'd from last meeting) (DaveBarach, 15:38:41)
  - short-term, move tests back to centralized
    location (DaveBarach,
    15:39:34)
 
 
- node enqueue improvements (DaveBarach, 15:39:52)
  - currently: enqueues very fast when all pkts go
    to same destination (DaveBarach,
    15:40:19)
- rewrote vlib_node_enqueue_to_next(...) to use
    SIMD instrs (DaveBarach,
    15:40:51)
- significant change, but reduces 20 clocks to 2
    or 3 clocks in the general case (DaveBarach,
    15:41:46)
- handoff code in progress (DaveBarach,
    15:43:16)
- multiple tx queue support in progress
    (DaveBarach,
    15:43:33)
- not clear whether the two in-progress items
    will end up in 21.06 (DaveBarach,
    15:44:27)
- will try to combine handoff frames (DaveBarach,
    15:45:14)
- should improve high worker count scenarios
    where the number of tx queues is lower than the number of
    workers (DaveBarach,
    15:48:27)
- multiple places hash packets to queues. Want to
    create infra to handle the problem in a general way (DaveBarach,
    15:51:09)
 
Meeting ended at 15:54:22 UTC
(full logs).
Action items
  - (none)
People present (lines said)
  - DaveBarach (47)
- mackonstan (22)
- collab-meetbot (5)
- dmarion (0)
Generated by MeetBot 0.1.4.