Thursday, 2022-10-27

*** shoragan <shoragan!~shoragan@user/shoragan> has quit IRC (*.net *.split)02:20
*** shoragan <shoragan!~shoragan@user/shoragan> has joined #cip02:22
*** rajm <rajm!~robert@cpc126990-macc4-2-0-cust43.1-3.cable.virginm.net> has joined #cip06:35
*** prabhakarlad <prabhakarlad!~prabhakar@pc.renesas.eu> has joined #cip07:38
*** prabhakarlad <prabhakarlad!~prabhakar@pc.renesas.eu> has quit IRC (Ping timeout: 244 seconds)08:27
*** prabhakarlad <prabhakarlad!~prabhakar@pc.renesas.eu> has joined #cip08:57
*** uli <uli!~uli@d5369b80.access.ecotel.net> has joined #cip10:50
*** masami <masami!~masami@FL1-125-198-44-131.tky.mesh.ad.jp> has joined #cip11:42
*** tobsch <tobsch!~quassel@195.145.170.188> has joined #cip11:46
*** pave1 <pave1!~pavel@jabberwock.ucw.cz> has joined #cip11:58
*** jki <jki!~jki@195.145.170.157> has joined #cip12:00
jkihi there!12:00
iwamatsuhi12:00
ulihello12:00
masamihello12:00
alicefmHi12:00
patersonc[m]Hello12:00
pave1hi12:00
jki#startmeeting CIP IRC weekly meeting12:02
collab-meetbot`Meeting started Thu Oct 27 12:02:15 2022 UTC and is due to finish in 60 minutes.  The chair is jki. Information about MeetBot at http://wiki.debian.org/MeetBot.12:02
collab-meetbot`Useful Commands: #action #agreed #help #info #idea #link #topic #startvote.12:02
collab-meetbot`The meeting name has been set to 'cip_irc_weekly_meeting'12:02
*** collab-meetbot` changes topic to " (Meeting topic: CIP IRC weekly meeting)"12:02
jki#topic AI review12:02
*** collab-meetbot` changes topic to "AI review (Meeting topic: CIP IRC weekly meeting)"12:02
jki1. Add qemu-riscv to cip-kernel-config - patersonc12:02
patersonc[m]No updates12:02
jkitoo bad - we are close to zero AI ;)12:03
jkiany other AIs?12:03
patersonc[m]Sorry!12:04
jkinp12:04
jkithen moving on...12:04
jki312:04
jki212:04
jki112:04
jki#topic Kernel maintenance updates12:04
*** collab-meetbot` changes topic to "Kernel maintenance updates (Meeting topic: CIP IRC weekly meeting)"12:04
ulireviewing 5.10.15012:05
pave1Reviewing 5.10.150, 151.12:05
masamiThere is 20 CVEs reported. These are not so critical vulnerabilities.12:05
iwamatsuI reviewed 5.10.150.12:05
jkiany other topic on maintenance?12:08
jkithen: moving on...12:10
jki312:10
jki212:10
jki112:10
pave1Busy week with huge reviews from -rc1, sorry.12:10
jkiok :)12:10
pave1We can move on.12:10
jki#topic Kernel testing12:10
*** collab-meetbot` changes topic to "Kernel testing (Meeting topic: CIP IRC weekly meeting)"12:10
alicef_patersonc[m]: I wrote on #lava-docker about the lava security problem12:11
patersonc[m]Thanks, I saw it recently. We really need to upgrade our LAVA version anyway, this is a good kick12:11
alicef_oh I just see your replay12:11
iwamatsunice12:11
patersonc[m]I'll try and start investigating in the next few weeks, I just need to get a few internal things sorted first12:12
alicef_thanks pave1 for sending the email with the warnings checks12:12
patersonc[m]+112:13
pave1You are welcome :-).12:13
pave1For the results, I believe good start would be reporting failure whenever test changes.12:14
pave1I guess that will simply result on smc fails on qemu? We can disable those as a next step.12:14
pave1Seeing one fail and investigating is better than having to go through all results every time to see if something maybe failed somewhere.12:15
alicef_disable it only for qemu ?12:15
pave1alicef: Yes. Or.. does it cause problems elesewhere?12:15
alicef_I don't think so12:16
pave1(Or it can be still run and result ignored, or something, if that's easier).12:17
patersonc[m]>For the results, I believe good start would be reporting failure whenever test changes.12:18
patersonc[m]This would mean setting up a database to keep track of previous test results etc.12:18
patersonc[m]Then comparing previous result to the new result, and only fail if it's changed from pass->fail12:18
patersonc[m]Is this what you mean?12:18
patersonc[m] * > For the results, I believe good start would be reporting failure whenever test changes.12:18
patersonc[m]This would mean setting up a database to keep track of previous test results etc.12:18
patersonc[m]Then comparing previous result to the new result, and only fail if it's changed from pass->fail12:18
patersonc[m]Is this what you mean?12:18
pave1No, not really.12:18
pave1"If the target is qemu && test is smc then ignore the test result".12:19
patersonc[m]Yes, but we get smc failures on other boards as well12:19
pave1Ok, can I get an exampel?12:19
patersonc[m]https://gitlab.com/cip-project/cip-kernel/linux-cip/-/jobs/3141003858#L26212:20
pave1Can we run smc on corresponding upstream kernel, to verify we did not break anything, and then probably disable smc test?12:21
pave1Thanks for example, I'll take a look.12:21
patersonc[m]Check the email I sent on Tuesday for more links12:22
* patersonc[m] checks for more links now - 2 secs12:24
patersonc[m]Upstream: https://linux.kernelci.org/test/plan/id/635a180f43c30ac29ce7db6f/12:24
patersonc[m]CIP: https://gitlab.com/cip-project/cip-testing/linux-cip-ci/-/jobs/3224759608#L113212:25
patersonc[m]Both on the hihope-rzg2m board12:25
patersonc[m]Same CVE failure12:25
patersonc[m](obviously different kernels)12:25
pave1If the test fails even on 6.1-rc2, it will probably fail in 5.10-stable and 5.10-cip, too.12:26
pave1Let's ignore the test.12:26
pave1Going through the emails, you said: "I'd still need this job to return a green tick though if the actual job runs successfully, even if there are failed test cases.12:27
iwamatsuBut xilinx MPSoC is no issue. https://lava.ciplatform.org/scheduler/job/771035#L85912:27
pave1Actually, I believe what we should do.12:27
pave1This is not kernel issue, this is subtly broken hardware.12:27
pave1Plus it is "security" and people love to say "this is critical"12:28
pave1when it is not.12:28
pave1We have security team. Could we simply ask our security team to investigate that?12:28
pave1...best in cooperation with hardware vendors. :-)12:29
pave1Why do you want to put green ticks on failing tests?12:29
pave1Lets simply use "green" -> nothing to see here.12:29
pave1"red" -> someone needs to look at the results.12:29
pave1I guess it is normally me looking at the results, and I escalate if I can't understand that, but don't make me examine logs for 12 targets when there's nothing to see there.12:30
patersonc[m]This was referring to if there are tests that always fail, such as those smc tests12:31
pave1Ok, lets disable those on platforms where they always fail.12:32
patersonc[m]Is a view like this better?12:32
patersonc[m]https://linux.kernelci.org/test/job/cip/branch/linux-4.4.y-cip/kernel/v4.4.302-cip70-98-g7f7838c92740f/12:32
pave1Not really. When I tried to investigate a failure, I got into maze of results..12:34
pave1...https://linux.kernelci.org/test/plan/id/62d01083f0ebb14854a39bde/12:34
pave1...which seems to be caused by flakey target failing randomly.12:34
patersonc[m]But flakey targets is no different for kernelci compared to cip right?12:35
pave1...but because results are sorted by test (not target), it is not obvious to see.12:35
patersonc[m]okay12:35
patersonc[m]This view should show the targets that failed: https://linux.kernelci.org/test/job/cip-gitlab/branch/ci%2Fpavel%2Flinux-test/kernel/v5.10.147-cip18-11-ge6d27ea102c3b/plan/baseline/12:36
pave1It works as long as they are no flakey things. We are happy to have none in the gitlab.12:36
patersonc[m]s/failed/regressed/12:36
alicef_about the asus chromebook failures I asked on kernelci about that. and they was looking into such problems. not sure if they did something.12:36
alicef_actually the best would be to open a issue on kernelci about boards that fails often, as it can be relevant also for other kernel tree12:37
pave1So the way I use test branch (pushing 5.10, 4.19 and 4.4 into it) leads into really confusing stuff on kernelci.12:38
patersonc[m]Could you use a different branch for each kernel version?12:39
patersonc[m]Then the automated regression tracking would work12:40
pave1I'll need to jump on a branch, too, during bisect for example.12:41
pave1I'm afraid I'll get the regression tracking confused..12:41
jkithen you need a separate branch for debugging, vs. one per kernel that is moving forward only and tracks regressions12:43
pave1Yes, that's current setup.12:44
pave1Actually we have 4.4-st-rc and 4.4-st because more debugging is happening there.12:44
patersonc[m]Taking a step back a bit, why don't we change our GitLab CI approach to be more for development - "instant" CI, and mainly focus on kernel builds and boot tests.12:45
patersonc[m]Then use KernelCI for the full (pre)release testing with full build/test coverage.12:45
patersonc[m]GitLab CI can then provide simple green/red statuses, and KernelCI can track regressions properly etc.12:45
patersonc[m]Best of both worlds12:45
patersonc[m]* builds and (only) boot tests.12:46
patersonc[m]Then rather spend resources trying to make our gitlab CI setup do the same thing as kernelci, we can spend them on improving kernelci for all, and on improving things like platform stability in the labs12:46
alicef_updating lava also12:47
pave1Kernelci is quite confusing to me. Gitlab CI is a bit better, but hiding failures behind green ticks is not nice.12:48
pave1We can split testing between kernelci and gitlab anyway you want, if you take over interpretting the results.12:49
pave1I can simply ask over email, and testing team says "okay, that kernel looks good to us" or "hey there's bug you should debug".12:50
patersonc[m]Is anyone else in the kernel team looking at test results on KernelCI/gitlab?12:51
ulii don't, usually12:52
iwamatsuI am checking sometime.12:52
jkiis the number of reports we get on cip-dev useful then, or can it be made useful?12:54
pave1I don't find those reports useful.12:57
patersonc[m]From each build triggered on kernelci we get 1 email reporting on the build status, and 1 email per test suite.12:58
patersonc[m]The test emails have a section at the top which tells you which boards had regressions. There is a link to click in order to see the logs etc.12:58
iwamatsuIs there a lot of unnecessary information?12:58
patersonc[m]There's a lot more information if you scroll down, but it's direct links to each board that failed, rather then the summary link at the top13:00
patersonc[m]So you can either click the summary link and navigate the website, or click on the specific links. All useful depending on your choice13:01
alicefmAlso already KernelCI works by regression and notify if a previous working test failed13:01
pave1Well, there's buch of reports from kernelci on cip-dev from Oct 26.13:02
alicefmSMC problem got notified because on the previous 4.4-CIP builds it was not failings13:03
pave1It reports lot of failures including regressions, but I doubt there's a real regression in 5.10-rc tree.13:03
alicefmPavel that because KernelCI is sending a build report and a board/regression reports13:03
pave1Could we get a red cross whenever gitlab detects something that can be debugged, please?13:04
alicefmBuild report are always sent, board/regression reports are sent with problems13:05
pave1I'm trying to get gitlab useful, first. kernelci on 5.10 will require more work.13:06
patersonc[m]Okay13:08
patersonc[m]Please take a look at my email from tuesday and comment on the MR etc. Then GitLab will be red if individual test cases fail13:08
pave1Ok.13:08
pave1Can we get smc disabled on problematic boards?13:09
patersonc[m]I'll have to check if any of the boards pass13:10
patersonc[m]May be easier to disable it alltogether if we're not interested in the results13:10
patersonc[m]But it's not just smc. We'll get failures in all the ltp tests as well13:10
pave1Can we disable those ltp cases that always fail?13:11
iwamatsuThese can be controlled with LAVA metadata.13:11
pave1But I don't see ltp failures.13:12
pave1I picked up random example:13:12
pave1https://gitlab.com/cip-project/cip-testing/linux-stable-rc-ci/-/jobs/323583970413:12
pave1and only see 0_spectre-meltdown-checker-test.CVE-2018-3640 [fail] there.13:12
patersonc[m]That test job only ran boot and smc tests13:12
pave1When are we running ltp tests?13:13
patersonc[m]On rc testing13:14
patersonc[m]Here's an example: https://gitlab.com/cip-project/cip-kernel/linux-cip/-/jobs/318628197313:14
pave1Fun. That will need some more work.13:15
pave1But I'd still proceed with the red cross, so we know we have work to do.13:15
patersonc[m]Okay13:15
pave1Thank you!13:16
patersonc[m]Anyway, the MR is there to comment on.13:16
patersonc[m]Let's move on because we're already well over time13:16
jkiindeed :)13:16
jki313:16
jki213:16
jki113:16
jki#topic AOB13:16
*** collab-meetbot` changes topic to "AOB (Meeting topic: CIP IRC weekly meeting)"13:16
jkianything else to discuss today?13:17
jki313:18
jki213:18
jki113:18
jki#endmeeting13:18
collab-meetbot`Meeting ended Thu Oct 27 13:18:09 2022 UTC.  Information about MeetBot at http://wiki.debian.org/MeetBot . (v 0.1.4)13:18
collab-meetbot`Minutes:        http://ircbot.wl.linuxfoundation.org/meetings/cip/2022/10/cip.2022-10-27-12.02.html13:18
collab-meetbot`Minutes (text): http://ircbot.wl.linuxfoundation.org/meetings/cip/2022/10/cip.2022-10-27-12.02.txt13:18
collab-meetbot`Log:            http://ircbot.wl.linuxfoundation.org/meetings/cip/2022/10/cip.2022-10-27-12.02.log.html13:18
*** collab-meetbot` changes topic to "Civil Infrastructure Platform Project. CIP mailing list at https://lists.cip-project.org/g/cip-dev | CIP kernel meeting every Thursday at 12:00 UTC | Find the meeting logs at https://ircbot.wl.linuxfoundation.org/meetings/cip/ and chat logs at https://ircbot.wl.linuxfoundation.org/logs/%23cip/""13:18
jkithen: thank you all!13:18
ulithanks13:18
alicefmThank you13:18
masamithank you13:18
iwamatsuthank you13:18
jkipatersonc[m]: any news from the rzfive and ldconfig?13:18
pave1Thank you!13:19
patersonc[m]jki: Not yet, sorry. It is being looked at though13:23
*** uli <uli!~uli@d5369b80.access.ecotel.net> has left #cip (Leaving)13:28
*** jki <jki!~jki@195.145.170.157> has quit IRC (Quit: Leaving)14:32
*** prabhakarlad <prabhakarlad!~prabhakar@pc.renesas.eu> has quit IRC (Quit: Client closed)15:09
*** tobsch <tobsch!~quassel@195.145.170.188> has quit IRC (Ping timeout: 244 seconds)15:14
*** tobsch <tobsch!~quassel@ip5f5bfaca.dynamic.kabel-deutschland.de> has joined #cip16:38
*** tobsch <tobsch!~quassel@ip5f5bfaca.dynamic.kabel-deutschland.de> has quit IRC (Ping timeout: 244 seconds)16:46
*** tobsch <tobsch!~quassel@2a02:810d:4bc0:2020:7b79:e9a7:3687:726f> has joined #cip16:46
*** tobsch <tobsch!~quassel@2a02:810d:4bc0:2020:7b79:e9a7:3687:726f> has quit IRC (Ping timeout: 255 seconds)17:02
*** tobsch <tobsch!~quassel@2a02:810d:4bc0:2020:e1b:11c:aca3:c3c8> has joined #cip17:13
*** tobsch <tobsch!~quassel@2a02:810d:4bc0:2020:e1b:11c:aca3:c3c8> has quit IRC (Ping timeout: 250 seconds)17:31
*** prabhakarlad <prabhakarlad!~prabhakar@pc.renesas.eu> has joined #cip19:05
iwamatsubry449dj20:39
iwamatsuomg20:41
*** prabhakarlad <prabhakarlad!~prabhakar@pc.renesas.eu> has quit IRC (Quit: Client closed)21:05
*** prabhakarlad <prabhakarlad!~prabhakar@pc.renesas.eu> has joined #cip21:09
*** rajm <rajm!~robert@cpc126990-macc4-2-0-cust43.1-3.cable.virginm.net> has quit IRC (Ping timeout: 244 seconds)21:52

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!