Re: [RFC PATCH v2 00/12] crypto: Adiantum support

From: Ard Biesheuvel
Date: Fri Oct 19 2018 - 23:24:15 EST

Next message: Ard Biesheuvel: "Re: [RFC PATCH v2 06/12] crypto: arm/chacha20 - refactor to allow varying number of rounds"
Previous message: kbuild test robot: "[tip:locking/core 6/10] arch/x86/include/asm/rmwcc.h:23:17: error: jump into statement expression"
In reply to: Paul Crowley: "Re: [RFC PATCH v2 00/12] crypto: Adiantum support"
Next in thread: Eric Biggers: "Re: [RFC PATCH v2 00/12] crypto: Adiantum support"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On 20 October 2018 at 02:19, Paul Crowley <paulcrowley@xxxxxxxxxx> wrote:
> On Fri, 19 Oct 2018 at 08:58, Jason A. Donenfeld <Jason@xxxxxxxxx> wrote:
>> Before merging this into the kernel, do you want to wait until you've
>> received some public review from academia?
>
> I would prefer not to wait. Unlike a new primitive whose strength can
> only be known through attempts at cryptanalysis, Adiantum is a
> construction based on
> well-understood and trusted primitives; it is secure if the proof
> accompanying it is correct. Given that (outside competitions or
> standardization efforts) no-one ever issues public statements that
> they think algorithms or proofs are good, what I'm expecting from
> academia is silence :) The most we could hope for would be getting the
> paper accepted at a conference, and we're pursuing that but there's a
> good chance that won't happen simply because it's not very novel. It
> basically takes existing ideas and applies them using a stream cipher
> instead of a block cipher, and a faster hashing mode; it's also a
> small update from HPolyC. I've had some private feedback that the
> proof seems correct, and that's all I'm expecting to get.

Hi Paul, Eric,

The Adiantum paper claims

"On an ARM Cortex-A7 processor, Adiantum decrypts 4096-byte messages
at 11 cycles per byte, five times faster than AES-256-XTS, with a
constant-time implementation."

which is surprising to me. The bit slicing NEON AES core runs at ~14
cycle per byte on a Cortex-A15 (when encrypting), so 55 cycles per
byte on A7 sounds rather high. Is it really that bad?

Also, the paper mentions that the second hash pass and the stream
cipher en/decryption pass could be executed in parallel, while your
implementation performs three distinct passes. Do you have any
estimates on the potential performance gain of implementing that? In
my experience (which is mostly A53 rather than A7 based, mind you),
removing memory accesses can help tremendously to speed up the
execution on low end cores.

Next message: Ard Biesheuvel: "Re: [RFC PATCH v2 06/12] crypto: arm/chacha20 - refactor to allow varying number of rounds"
Previous message: kbuild test robot: "[tip:locking/core 6/10] arch/x86/include/asm/rmwcc.h:23:17: error: jump into statement expression"
In reply to: Paul Crowley: "Re: [RFC PATCH v2 00/12] crypto: Adiantum support"
Next in thread: Eric Biggers: "Re: [RFC PATCH v2 00/12] crypto: Adiantum support"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]