Re: regression tracking (Re: Linux 2.6.21)

From: Oleg Verych
Date: Sat Jun 16 2007 - 00:51:39 EST


[I've added Herbert as former kernel team member in the debian(AFAIK),
sorry, if i'm wrong and you have no opinion on that, Herbert.]

On Sat, Jun 16, 2007 at 04:55:16AM +0200, Adrian Bunk wrote:
> On Sat, Jun 16, 2007 at 03:32:36AM +0200, Oleg Verych wrote:
> > On Sat, Jun 16, 2007 at 01:42:02AM +0200, Adrian Bunk wrote:
> > > On Fri, Jun 15, 2007 at 04:20:57PM -0700, Linus Torvalds wrote:
> > > >
> > > >
> > > > On Thu, 14 Jun 2007, Oleg Verych wrote:
> > > > >
> > > > > I'm seeing this long (198) thread and just have no idea how it has
> > > > > ended (wiki? hand-mailing?).
> > > >
> > > > I'm hoping it's not "ended".
> > > >
> > > > IOW, I really don't think we _resolved_ anything, although the work that
> > > > Adrian started is continuing through the wiki and other people trying to
> > > > track regressions, and that was obviously something good.
> > > >
> > > > But I don't think we really know where we want to take this thing in the
> > > > long run. I think everybody wants a better bug-tracking system, but
> > > > whether something that makes people satisfied can even be built is open.
> > > > It sure doesn't seem to exist right now ;)
> > >
> > > The problem is not the bug tracking system, be it manual tracking in a
> > > text file or a Wiki or be it in Bugzilla or any other bug tracking
> > > system.
> > >
> > > One problem is the lack of experienced developers willing to debug bug
> > > reports.
> >
> > *debug*
> >
> > I hope you saw what subject i've chosen to bring this discussion back.
> > Yes, "tracking", as the first brick for big wall.
>
> Tracking regressions is not a real problem.
> Especially since it doesn't require much technical knowledge.

I've tried to express different point of view. We have different ones {0}.
Thus, no comments.

> > Your arguments about developers and users, you've said already, but i've
> > asked different questions, have i?
> >
> > Lets look on regular automatic report, like this one:
> >
> > Message-ID: <E1Hz5HK-0007uB-MO@xxxxxxxxxxxxxxxxx>
> > Xref: news.gmane.org gmane.linux.debian.devel.general:116248
> > Archived-At: <http://permalink.gmane.org/gmane.linux.debian.devel.general/116248
> >
> > And what we see? Basic packages, like ``dpkg'', ``grub'', ``mc'' are
> > in the list, requesting help. And as you can see for quite some time.
> > And it's *OK*, because distribution is working, development is going on.
> > Sometimes slowly, sometimes with delays.
>
> I sent weekly regression reports.
> Not automatically generated but manually - but that doesn't matter.
>
> The problem is that sending reports itself does not fix anything.

...{1}

> > > But what really annoyed me was the missing integration of regression
> > > tracking into the release process, IOW how _you_ handled the regression
> > > lists.
> >
> > So, _tracking_ or _debugging_?
>
> _debugging_ (can be indirectly by poking other people to do debugging)
>
> > > If we want to offer something less of a disaster than 2.6.21, and if we
> > > want to encourage people to start and continue testing -rc kernels, we
> > > must try to fix as many reported regressions as reasonably possible.
> >
> > *tracking*
>
> no, *debugging*
>
> I tracked regressions for the 2.6.21 disaster, and the not debugged
> regressions that I had tracked are exactly where we should be better.

...{2}

> >...
> > > This means going through every single point in the regression list
> > > asking "Have we tried everything possible to solve this regression?".
> > > There are very mysterious regressions and there are regressions that
> > > might simply be reported too late. But if at the time of the final
> > > release 3 week old regressions hadn't been debugged at all there's
> > > definitely room for improvement. And mere mortals like me reminding
> > > people is often not enough, sometimes an email by Linus Torvalds himself
> > > asking a patch author or maintainer to look after a regression might be
> > > required.
> >
> > *social* (first approximation)
>
> Yes.
>
> > That's a social problem, just like Debian loosing good kernel team members.
>
> Different social problem.

The term ``like'' here means people are not able/willing to do work, they
might/can do. And cause of it is *social*, not technical. {1},{2} are
results of that problem/behavior. But according to {0}, you think,
differently.

> > For example you feel, that you've wasted time. But after all, if you've
> > came up with some kind of tool, everybody else could take it. No
> > problems, useful ideas must and will evolve. But _ideally_ this must not be
> > from ground zero every time. _Ideally_ from technical, not personal
> > point of view ;).
>
> As I expected, someone else has picked up regression tracking.
> And other from what you claim, this did not require any kind of tool.

So you expected good, doing bad. ``Bad'' means bringing pointless flame
about what everybody should do, without constructive approach like: "OK,
i can't do it due to my POV{0}, useless manual work. Everybody willing to
bring another way of dealing with it is welcome."

Your first reply:

"And it's not that Linus started developing the Linux kernel with writing
git, the first 10 years of Linux development were without any SCM." {3}

to my note about, that you are not hurry anywhere, that after all that
years of Open Source and Free Software development you are not trying
to deal with such important thing like regression/bug tracking in
***organized way***, is rather pointless.

> > That's why people in Debian have started *team* maintenance with alioth.
>
> The problem for the Linux kernel is that for a better bug handling you'd
> need people willing to learn other people's code and to do the hard work
> of debugging bug reports.

... {0}++

Do you know, for example, why i'm not making my "hacker's career"
doing that?

1. because i ended up with lynx, slrn, mutt, emacs-nox. Including
"zarro bogs found" kind of thing and other "userspace suck" problems.
2. i have no way to know if something *really* broken, unless it right
on my hardware

This all unlike Debian BTS using reportbug, where you have basic
information, mbox format messages for easy "mutt -f", and other funny
things, real maintainers aware of (i'm trying to know, learn about).

Thus organized, non brain-damaged way of reporting and tracking is the
key issue.

> E.g. writing a new filesystem is simply much more fun than learning and
> debugging other people's code in some old filesystem.

It's like in {3} -- i don't like it (personally), so i'm going along.

That's wrong and counter productive, as i've tried to explain.

> Talking about "team maintenance" sounds nice, but the problem in the
> kernel starts with code that has zero maintainers.

Floppy went pretty fine, before it was started to be maintained, and
you know that. But you also told that unmaintained and not-working are
different things.

Thus, if that just happen to break, well reports are welcome, and if
long time run will show, that user count is small, so let be it.
Nothing is long forever.

Positive side i think obvious, because Linus have found ext2 bug back
under the New Year tree (AFAIK), Thomas did his best on timers, etc.

For more productivity score system must be implemented and synchronized[0]
with distributions. Only after that *noise* filter, you may claim
importance of problems. Otherwise, you must know how noise affects human
beings.

(In the prev. e-mail i've mentioned such effort from one of the DDs:
[0] Message-ID: <handler.s.C.118074292516912.transcript@xxxxxxxxxxxxxxx>
Xref: news.gmane.org gmane.linux.debian.devel.kernel:29426
Archived-At: <http://permalink.gmane.org/gmane.linux.debian.devel.kernel/29426>
)

> And if there's already a maintainer, it's unlikely that he'll not
> accept patches from some new person debugging bug reports. But how to
> find people who will become good maintainers?

... {0}++

> > Unfortunately problems with individuals in big machine with bad people,
> > got randomly elected, can't be solved (IMHO). Even LKML's rule "patches
> > are welcome", that is very technical, thus good, doesn't work there.
>
> Debian has it's own problems, Linux kernel development at least has a
> working structure for getting decisions and regular releases.
>
> > Finally, read the end of 2.6.21 release message ;)
> >
> > > And a low hanging fruit to improve the release would be if you could
> > > release one last -rc, wait for 48 hours, and then release either this
> > > -rc unchanged as -final or another -rc (and wait another 48 hours).
> > > There were at least two different regressions people ran into in 2.6.21
> > > who successfully tested -rc7.
> >
> > What about Linus' tree is a development tree, Andrew's one is a "crazy
> > development one" (quoting Linus)?
> >
> > What about open (web page still exists) position on bug manager in
> > Google?
> >
> > What about *volunteers* working from both developer's and user's
> > sides? What about "release is a reward" for everybody?
>
> People aren't that dumb that some empty words like "release is a reward"
> would change anything.

So, distro kernel teams, making .21 available to their user are ones?
That's simply pointless.

> > Balanced eco-system will oscillate. Be it .19(---++), .20(-++++),
> > .21(----+) *relese*. That's natural, unless pushed to extremes.
> >
> > I think, i can trust Linus in it, and you are welcome too.
> >
> > *tools*
> >
> > That's why i'm talking about tools, and started to discuss them.
> >
> > My last try: reportbug (there's "-ng" one also), Debian BTS.
> >
> > Adrian, what can/must be done to adopt them? If not, your experience may
> > provide information about "why?" (re-consider my first mail about
> > background, please).
>
> Bug tracking for the kernel is more or less working.
> The main problem is getting people to debug bug reports.
>
> We need the main problem fixed, not a different tool in an area that is
> not the main problem.

I see (this repetition). Maybe i've managed to express my POV, that it
can be seeing more cleanly.
____
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/