Re: [PATCH v3 2/2] soc: mediatek: add mtk-devapc driver

From: Neal Liu
Date: Tue Jul 28 2020 - 23:16:46 EST


On Wed, 2020-07-29 at 10:22 +0800, Chun-Kuang Hu wrote:
> Neal Liu <neal.liu@xxxxxxxxxxxx> æ 2020å7æ29æ éä äå10:10åéï
> >
> > Hi Chun-Kuang,
> >
> > On Tue, 2020-07-28 at 23:35 +0800, Chun-Kuang Hu wrote:
> > > Hi, Neal:
> > >
> > > Neal Liu <neal.liu@xxxxxxxxxxxx> æ 2020å7æ28æ éä äå11:52åéï
> > > >
> > > > Hi Chun-Kuang,
> > > >
> > > > On Mon, 2020-07-27 at 22:47 +0800, Chun-Kuang Hu wrote:
> > > > > Hi, Neal:
> > > > >
> > > > > Neal Liu <neal.liu@xxxxxxxxxxxx> æ 2020å7æ27æ éä äå11:06åéï
> > > > > >
> > > > > > Hi Chun-Kuang,
> > > > > >
> > > > > > On Fri, 2020-07-24 at 23:55 +0800, Chun-Kuang Hu wrote:
> > > > > > > Hi, Neal:
> > > > > > >
> > > > > > > Neal Liu <neal.liu@xxxxxxxxxxxx> æ 2020å7æ24æ éä äå2:55åéï
> > > > > > > >
> > > > > > > > Hi Chun-Kuang,
> > > > > > > >
> > > > > > > > On Fri, 2020-07-24 at 00:32 +0800, Chun-Kuang Hu wrote:
> > > > > > > > > Hi, Neal:
> > > > > > > > >
> > > > > > > > > Neal Liu <neal.liu@xxxxxxxxxxxx> æ 2020å7æ23æ éå äå2:11åéï
> > > > > > > > > >
> > > > > > > > > > Hi Chun-Kuang,
> > > > > > > > > >
> > > > > > > > > > On Wed, 2020-07-22 at 22:25 +0800, Chun-Kuang Hu wrote:
> > > > > > > > > > > Hi, Neal:
> > > > > > > > > > >
> > > > > > > > > > > Neal Liu <neal.liu@xxxxxxxxxxxx> æ 2020å7æ22æ éä äå11:49åéï
> > > > > > > > > > > >
> > > > > > > > > > > > Hi Chun-Kuang,
> > > > > > > > > > > >
> > > > > > > > > > > > On Wed, 2020-07-22 at 07:21 +0800, Chun-Kuang Hu wrote:
> > > > > > > > > > > > > Hi, Neal:
> > > > > > > > > > > > >
> > > > > > > > > > > > > Neal Liu <neal.liu@xxxxxxxxxxxx> æ 2020å7æ21æ éä äå12:00åéï
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +/*
> > > > > > > > > > > > > > + * mtk_devapc_dump_vio_dbg - get the violation index and dump the full violation
> > > > > > > > > > > > > > + * debug information.
> > > > > > > > > > > > > > + */
> > > > > > > > > > > > > > +static bool mtk_devapc_dump_vio_dbg(struct mtk_devapc_context *ctx, u32 vio_idx)
> > > > > > > > > > > > > > +{
> > > > > > > > > > > > > > + u32 shift_bit;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + if (check_vio_mask(ctx, vio_idx))
> > > > > > > > > > > > > > + return false;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + if (!check_vio_status(ctx, vio_idx))
> > > > > > > > > > > > > > + return false;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + shift_bit = get_shift_group(ctx, vio_idx);
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + if (sync_vio_dbg(ctx, shift_bit))
> > > > > > > > > > > > > > + return false;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + devapc_extract_vio_dbg(ctx);
> > > > > > > > > > > > >
> > > > > > > > > > > > > I think get_shift_group(), sync_vio_dbg(), and
> > > > > > > > > > > > > devapc_extract_vio_dbg() should be moved out of vio_idx for-loop (the
> > > > > > > > > > > > > loop in devapc_violation_irq()) because these three function is not
> > > > > > > > > > > > > related to vio_idx.
> > > > > > > > > > > > > Another question: when multiple vio_idx violation occur, vio_addr is
> > > > > > > > > > > > > related to which one vio_idx? The latest happened one?
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > Actually, it's related to vio_idx. But we don't use it directly on these
> > > > > > > > > > > > function. I think below snip code might be better way to understand it.
> > > > > > > > > > > >
> > > > > > > > > > > > for (...)
> > > > > > > > > > > > {
> > > > > > > > > > > > check_vio_mask()
> > > > > > > > > > > > check_vio_status()
> > > > > > > > > > > >
> > > > > > > > > > > > // if get vio_idx, mask it temporarily
> > > > > > > > > > > > mask_module_irq(true)
> > > > > > > > > > > > clear_vio_status()
> > > > > > > > > > > >
> > > > > > > > > > > > // dump violation info
> > > > > > > > > > > > get_shift_group()
> > > > > > > > > > > > sync_vio_dbg()
> > > > > > > > > > > > devapc_extract_vio_dbg()
> > > > > > > > > > > >
> > > > > > > > > > > > // unmask
> > > > > > > > > > > > mask_module_irq(false)
> > > > > > > > > > > > }
> > > > > > > > > > >
> > > > > > > > > > > This snip code does not explain any thing. I could rewrite this code as:
> > > > > > > > > > >
> > > > > > > > > > > for (...)
> > > > > > > > > > > {
> > > > > > > > > > > check_vio_mask()
> > > > > > > > > > > check_vio_status()
> > > > > > > > > > >
> > > > > > > > > > > // if get vio_idx, mask it temporarily
> > > > > > > > > > > mask_module_irq(true)
> > > > > > > > > > > clear_vio_status()
> > > > > > > > > > > // unmask
> > > > > > > > > > > mask_module_irq(false)
> > > > > > > > > > > }
> > > > > > > > > > >
> > > > > > > > > > > // dump violation info
> > > > > > > > > > > get_shift_group()
> > > > > > > > > > > sync_vio_dbg()
> > > > > > > > > > > devapc_extract_vio_dbg()
> > > > > > > > > > >
> > > > > > > > > > > And my version is identical with your version, isn't it?
> > > > > > > > > >
> > > > > > > > > > Sorry, I did not explain it clearly. Let's me try again.
> > > > > > > > > > The reason why I put "dump violation info" between mask & unmask context
> > > > > > > > > > is because it has to stop interrupt first before dump violation info,
> > > > > > > > > > and then unmask it to prepare next violation.
> > > > > > > > > > These sequence guarantee that if multiple violation is triggered, we
> > > > > > > > > > still have information to debug.
> > > > > > > > > > If the code sequence in your version and multiple violation is
> > > > > > > > > > triggered, there might be no any information but keeps entering ISR.
> > > > > > > > > > Finally, system might be abnormal and watchdog timeout.
> > > > > > > > > > In this case, we still don't have any information to debug.
> > > > > > > > >
> > > > > > > > > I still don't understand why no information to debug. For example when
> > > > > > > > > vio_idx 5, 10, 15 has violation,
> > > > > > > > > You would mask vio_idx 5 to get information, but vio_idx 10, 15 does
> > > > > > > > > not mask yet.
> > > > > > > > > In your words, when vio_idx 10, 15 not mask, you would not get any
> > > > > > > > > debug information when you process vio_idx 5.
> > > > > > > > >
> > > > > > > > > In my version, I would clear all status, why keeps entering ISR?
> > > > > > > >
> > > > > > > > Think about this case, if someone tries to dump "AAA" module's register.
> > > > > > > > It would keep read reg base, base+0x4, base+0x8, ...
> > > > > > > > All these registers are in the same slave, which would be same vio_idx.
> > > > > > > > (Take vio_idx 5 as example)
> > > > > > > > In this case, vio_idx 5 will keep triggering interrupt. If you did not
> > > > > > > > do "dump violation info" between mask & unmask, you cannot get any
> > > > > > > > violation info until the last interrupt being handled.
> > > > > > > > Normally, system will crash before last interrupt coming.
> > > > > > >
> > > > > > > You have said that first vio_addr would be kept until it's 'handled'.
> > > > > > > So the first vio_addr reg_base would be kept even though other
> > > > > > > violation happen. And I could handle (clear status and dump info) it
> > > > > > > then vio_addr would next violation's address. I'm confused with your
> > > > > > > statement. If AAA is dumping register of vio_idx 5, BBB is dumping
> > > > > > > register of vio_idx 10, CCC is dumping register of vio_idx 15, I think
> > > > > > > you should mask all vio_idx not only one. So the code would be
> > > > > > >
> > > > > > > for all vio_idx {
> > > > > > > mask_module_irq(true)
> > > > > > > }
> > > > > > >
> > > > > > > devapc_extract_vio_dbg()
> > > > > > >
> > > > > > > for all vio_idx {
> > > > > > > clear_vio_status()
> > > > > > > mask_module_irq(false)
> > > > > > > }
> > > > > > >
> > > > > >
> > > > > > I'm also consider this solution and I think it's much better to
> > > > > > understand hardware behavior.
> > > > > >
> > > > > > devapc_dump_vio_dbg()
> > > > > > {
> > > > > > while(1) {
> > > > > > // might have multiple shift_bit raised
> > > > > > shift_bit = get_shift_group()
> > > > > > if (shift_bit >= 0 && shift bit <= 31)
> > > > > > sync_vio_dbg(shift_bit)
> > > > > > extract_vio_dbg()
> > > > >
> > > > > According to your statement, when multiple violation occur, only the
> > > > > first one is kept, others are dropped. I think we just need to dump
> > > > > debug info once.
> > > > >
> > > > > Because only one violation information would be kept, why not only one
> > > > > group (equal to no group)?
> > > > >
> > > > > Regards,
> > > > > Chun-Kuang.
> > > >
> > > > Let's me give you an example of devapc design.
> > > > vio_idx: 0, 1, 2 -> group 0 (shift_bit: 0)
> > > > vio_idx: 3, 4, 5 -> group 1 (shift_bit: 1)
> > > > ...
> > > >
> > > > Each group violation will keep one violation (the first one). If vio_idx
> > > > 0 is triggered first, vio_idx 1 is triggered next, then group 0 will
> > > > just keep vio_idx 0 violation info.
> > > > If vio_idx 2 is triggered first, vio_idx 3 is triggered next, then group
> > > > 0 will keep vio_idx 2 violation info, group 1 will keep vio_idx 3's.
> > > >
> > > > We have to scan all groups and dump everything we have.
> > > > Thanks !
> > > >
> > >
> > > Could we let all vio_idx be group 0 so that we could just sync one
> > > group? It's bad to spend too much time in irq handler.
> > > When we set pd_vio_shift_sel_reg, it seems we could set multiple group
> > > together, couldn't it?
> > >
> > > Regards,
> > > Chun-Kuang.
> > >
> >
> > No, Which group vio_idx belongs to is determined by hardware. Software
> > cannot change its group.
> > There is very low possibility that multiple groups has violation at the
> > same time, so it would not spend much time to handle it.
> > It also cannot shift multiple groups at the same time since there is
> > only one vio_info(rw, vio_addr, master_id, ...) exist at a time.
> > devapc_extract_vio_dbg() function is doing this step.
> >
>
> So this flow is OK for me. Would you please add comment for this
> information so that we could understand how hardware work.
>
> Regards,
> Chun-Kuang.
>

Okay, I'll add it in next patch. Thanks !

> > Thanks !
> >
> > > > >
> > > > > > else
> > > > > > break
> > > > > > }
> > > > > > }
> > > > > >
> > > > > > devapc_violation_irq()
> > > > > > {
> > > > > > for all vio_idx {
> > > > > > mask_module_irq(true)
> > > > > > }
> > > > > >
> > > > > > devapc_dump_vio_dbg()
> > > > > >
> > > > > > for all vio_idx {
> > > > > > clear_vio_status()
> > > > > > mask_module_irq(false)
> > > > > > }
> > > > > > }
> > > > > >
> > > > > > Is it more clear for this control flow?
> > > > > > Thanks !
> > > > > >
> > > > > > > >
> > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > About your question, vio_addr would be the first one.
> > > > > > > > > > >
> > > > > > > > > > > So other vio_addr would be dropped? Or hardware would keep all
> > > > > > > > > > > vio_addr and you have some way to get all vio_addr?
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > In this case, hardware will drop other violation info and keep the first
> > > > > > > > > > one until it been handled.
> > > > > > > > >
> > > > > > > > > Does 'handled' mean status is cleared?
> > > > > > > >
> > > > > > > > "handled" means clear status and dump violation info.
> > > > > > > >
> > > > > > > > >
> > > > > > > > > Regards,
> > > > > > > > > Chun-Kuang.
> > > > > > > > >
> > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + return true;
> > > > > > > > > > > > > > +}
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +/*
> > > > > > > > > > > > > > + * devapc_violation_irq - the devapc Interrupt Service Routine (ISR) will dump
> > > > > > > > > > > > > > + * violation information including which master violates
> > > > > > > > > > > > > > + * access slave.
> > > > > > > > > > > > > > + */
> > > > > > > > > > > > > > +static irqreturn_t devapc_violation_irq(int irq_number,
> > > > > > > > > > > > > > + struct mtk_devapc_context *ctx)
> > > > > > > > > > > > > > +{
> > > > > > > > > > > > > > + u32 vio_idx;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + for (vio_idx = 0; vio_idx < ctx->vio_idx_num; vio_idx++) {
> > > > > > > > > > > > > > + if (!mtk_devapc_dump_vio_dbg(ctx, vio_idx))
> > > > > > > > > > > > > > + continue;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + /* Ensure that violation info are written before
> > > > > > > > > > > > > > + * further operations
> > > > > > > > > > > > > > + */
> > > > > > > > > > > > > > + smp_mb();
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + /*
> > > > > > > > > > > > > > + * Mask slave's irq before clearing vio status.
> > > > > > > > > > > > > > + * Must do it to avoid nested interrupt and prevent
> > > > > > > > > > > > > > + * unexpected behavior.
> > > > > > > > > > > > > > + */
> > > > > > > > > > > > > > + mask_module_irq(ctx, vio_idx, true);
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + clear_vio_status(ctx, vio_idx);
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + mask_module_irq(ctx, vio_idx, false);
> > > > > > > > > > > > > > + }
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > + return IRQ_HANDLED;
> > > > > > > > > > > > > > +}
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +/*
> > > > > > > >
> > > > > >
> > > >
> >