Re: [PATCH v2 07/15] peci: Add peci-aspeed controller driver
From: Winiarska, Iwona
Date: Sun Aug 29 2021 - 15:42:18 EST
On Fri, 2021-08-27 at 09:24 -0700, Dan Williams wrote:
> On Thu, Aug 26, 2021 at 4:55 PM Winiarska, Iwona
> <iwona.winiarska@xxxxxxxxx> wrote:
> >
> > On Wed, 2021-08-25 at 18:35 -0700, Dan Williams wrote:
> > > On Tue, Aug 3, 2021 at 4:35 AM Iwona Winiarska
> > > <iwona.winiarska@xxxxxxxxx> wrote:
> > > >
> > > > From: Jae Hyun Yoo <jae.hyun.yoo@xxxxxxxxxxxxxxx>
> > > >
> > > > ASPEED AST24xx/AST25xx/AST26xx SoCs supports the PECI electrical
> > > > interface (a.k.a PECI wire).
> > >
> > > Maybe a one sentence blurb here and in the Kconfig reminding people
> > > why they should care if they have a PECI driver or not?
> >
> > Ok, I'll expand it a bit.
> [..]
> > > > +static int aspeed_peci_xfer(struct peci_controller *controller,
> > > > + u8 addr, struct peci_request *req)
> > > > +{
> > > > + struct aspeed_peci *priv = dev_get_drvdata(controller-
> > > > >dev.parent);
> > > > + unsigned long flags, timeout = msecs_to_jiffies(priv-
> > > > > cmd_timeout_ms);
> > > > + u32 peci_head;
> > > > + int ret;
> > > > +
> > > > + if (req->tx.len > ASPEED_PECI_DATA_BUF_SIZE_MAX ||
> > > > + req->rx.len > ASPEED_PECI_DATA_BUF_SIZE_MAX)
> > > > + return -EINVAL;
> > > > +
> > > > + /* Check command sts and bus idle state */
> > > > + ret = aspeed_peci_check_idle(priv);
> > > > + if (ret)
> > > > + return ret; /* -ETIMEDOUT */
> > > > +
> > > > + spin_lock_irqsave(&priv->lock, flags);
> > > > + reinit_completion(&priv->xfer_complete);
> > > > +
> > > > + peci_head = FIELD_PREP(ASPEED_PECI_TARGET_ADDR_MASK, addr) |
> > > > + FIELD_PREP(ASPEED_PECI_WR_LEN_MASK, req->tx.len) |
> > > > + FIELD_PREP(ASPEED_PECI_RD_LEN_MASK, req->rx.len);
> > > > +
> > > > + writel(peci_head, priv->base + ASPEED_PECI_RW_LENGTH);
> > > > +
> > > > + memcpy_toio(priv->base + ASPEED_PECI_WR_DATA0, req->tx.buf,
> > > > min_t(u8, req->tx.len, 16));
> > > > + if (req->tx.len > 16)
> > > > + memcpy_toio(priv->base + ASPEED_PECI_WR_DATA4, req-
> > > > >tx.buf +
> > > > 16,
> > > > + req->tx.len - 16);
> > > > +
> > > > + dev_dbg(priv->dev, "HEAD : 0x%08x\n", peci_head);
> > > > + print_hex_dump_bytes("TX : ", DUMP_PREFIX_NONE, req->tx.buf,
> > > > req-
> > > > > tx.len);
> > >
> > > On CONFIG_DYNAMIC_DEBUG=n builds the kernel will do all the work of
> > > reading through this buffer, but skip emitting it. Are you sure you
> > > want to pay that overhead for every transaction?
> >
> > I can remove it or I can add something like:
> >
> > #if IS_ENABLED(CONFIG_PECI_DEBUG)
> > #define peci_debug(fmt, ...) pr_debug(fmt, ##__VA_ARGS__)
> > #else
> > #define peci_debug(...) do { } while (0)
> > #endif
>
> It's the hex dump I'm worried about, not the debug statements as much.
>
> I think the choices are remove the print_hex_dump_bytes(), put it
> behind an IS_ENABLED(CONFIG_DYNAMIC_DEBUG) to ensure the overhead is
> skipped in the CONFIG_DYNAMIC_DEBUG=n case, or live with the overhead
> if this is not a fast path / infrequently used.
I will place it behind IS_ENABLED(CONFIG_DYNAMIC_DEBUG).
>
> >
> > (and similar peci_trace with trace_printk for usage in IRQ handlers and
> > such).
> >
> > What do you think?
>
> In general, no, don't wrap the base level print routines with
> driver-specific ones. Also, trace_printk() is only for debug builds.
> Note that trace points are built to be even less overhead than
> dev_dbg(), so there's no overhead concern with disabled tracepoints,
> they literally translate to nops when disabled.
Ack.
>
> >
> > >
> > > > +
> > > > + priv->status = 0;
> > > > + writel(ASPEED_PECI_CMD_FIRE, priv->base + ASPEED_PECI_CMD);
> > > > + spin_unlock_irqrestore(&priv->lock, flags);
> > > > +
> > > > + ret = wait_for_completion_interruptible_timeout(&priv-
> > > > > xfer_complete, timeout);
> > >
> > > spin_lock_irqsave() says "I don't know if interrupts are disabled
> > > already, so I'll save the state, whatever it is, and restore later"
> > >
> > > wait_for_completion_interruptible_timeout() says "I know I am in a
> > > sleepable context where interrupts are enabled"
> > >
> > > So, one of those is wrong, i.e. should it be spin_{lock,unlock}_irq()?
> >
> > You're right - I'll fix it.
> >
> > >
> > >
> > > > + if (ret < 0)
> > > > + return ret;
> > > > +
> > > > + if (ret == 0) {
> > > > + dev_dbg(priv->dev, "Timeout waiting for a response!\n");
> > > > + return -ETIMEDOUT;
> > > > + }
> > > > +
> > > > + spin_lock_irqsave(&priv->lock, flags);
> > > > +
> > > > + writel(0, priv->base + ASPEED_PECI_CMD);
> > > > +
> > > > + if (priv->status != ASPEED_PECI_INT_CMD_DONE) {
> > > > + spin_unlock_irqrestore(&priv->lock, flags);
> > > > + dev_dbg(priv->dev, "No valid response!\n");
> > > > + return -EIO;
> > > > + }
> > > > +
> > > > + spin_unlock_irqrestore(&priv->lock, flags);
> > > > +
> > > > + memcpy_fromio(req->rx.buf, priv->base + ASPEED_PECI_RD_DATA0,
> > > > min_t(u8, req->rx.len, 16));
> > > > + if (req->rx.len > 16)
> > > > + memcpy_fromio(req->rx.buf + 16, priv->base +
> > > > ASPEED_PECI_RD_DATA4,
> > > > + req->rx.len - 16);
> > > > +
> > > > + print_hex_dump_bytes("RX : ", DUMP_PREFIX_NONE, req->rx.buf,
> > > > req-
> > > > > rx.len);
> > > > +
> > > > + return 0;
> > > > +}
> > > > +
> > > > +static irqreturn_t aspeed_peci_irq_handler(int irq, void *arg)
> > > > +{
> > > > + struct aspeed_peci *priv = arg;
> > > > + u32 status;
> > > > +
> > > > + spin_lock(&priv->lock);
> > > > + status = readl(priv->base + ASPEED_PECI_INT_STS);
> > > > + writel(status, priv->base + ASPEED_PECI_INT_STS);
> > > > + priv->status |= (status & ASPEED_PECI_INT_MASK);
> > > > +
> > > > + /*
> > > > + * In most cases, interrupt bits will be set one by one but also
> > > > note
> > > > + * that multiple interrupt bits could be set at the same time.
> > > > + */
> > > > + if (status & ASPEED_PECI_INT_BUS_TIMEOUT)
> > > > + dev_dbg_ratelimited(priv->dev,
> > > > "ASPEED_PECI_INT_BUS_TIMEOUT\n");
> > > > +
> > > > + if (status & ASPEED_PECI_INT_BUS_CONTENTION)
> > > > + dev_dbg_ratelimited(priv->dev,
> > > > "ASPEED_PECI_INT_BUS_CONTENTION\n");
> > > > +
> > > > + if (status & ASPEED_PECI_INT_WR_FCS_BAD)
> > > > + dev_dbg_ratelimited(priv->dev,
> > > > "ASPEED_PECI_INT_WR_FCS_BAD\n");
> > > > +
> > > > + if (status & ASPEED_PECI_INT_WR_FCS_ABORT)
> > > > + dev_dbg_ratelimited(priv->dev,
> > > > "ASPEED_PECI_INT_WR_FCS_ABORT\n");
> > >
> > > Are you sure these would not be better as tracepoints? If you're
> > > debugging an interrupt related failure, the ratelimiting might get in
> > > your way when you really need to know when one of these error
> > > interrupts fire relative to another event.
> >
> > Tracepoints are ABI(ish), and using a full blown tracepoint just for IRQ
> > status
> > would probably be too much.
>
> Tracepoints become ABI once someone ships tooling that depends on them
> being there. These don't look attractive for a tool, and they don't
> look difficult to maintain if the interrupt handler needs to be
> reworked. I.e. it would be trivial to keep a dead tracepoint around if
> worse came to worse to keep a tool from failing to load.
After more consideration, I would prefer to remove these logs for now - in case
of error I'll log full status in xfer().
>
> > I was thinking about something like trace_printk hidden under a
> > "CONFIG_PECI_DEBUG" (see above), but perhaps that's something for the future
> > improvement?
>
> Again trace_printk() is only for private builds.
>
> >
> > >
> > > > +
> > > > + /*
> > > > + * All commands should be ended up with a
> > > > ASPEED_PECI_INT_CMD_DONE
> > > > bit
> > > > + * set even in an error case.
> > > > + */
> > > > + if (status & ASPEED_PECI_INT_CMD_DONE)
> > > > + complete(&priv->xfer_complete);
> > >
> > > Hmm, no need to check if there was a sequencing error, like a command
> > > was never submitted?
> >
> > It's handled by checking if HW is idle in xfer before a command is sent,
> > where
> > we just expect a single interrupt per command.
>
> I'm asking how do you determine if this status was spurious, or there
> was a sequencing error in the driver?
I don't think we have any means to determine it.
PECI itself doesn't provide any mechanism to verify it (there is no sequence
number or tag to match request/response).
We're relying on the fact that BMC is a requester and initiates communication
with CPU - the interrupt won't be generated if BMC doesn't send any request.
Thanks
-Iwona