Re: [RFC PATCH v2] ceph: add remote object copy counter to fs client metrics
From: Jeff Layton
Date: Tue Oct 26 2021 - 08:22:47 EST
On Mon, 2021-10-25 at 16:00 +0100, Luís Henriques wrote:
> This counter will keep track of the number of remote object copies done on
> copy_file_range syscalls. This counter will be kept using the metrics
> infrastructure and thus accessible through debugfs. For now, this counter
> won't be sent to the MDS.
>
> Cc: Patrick Donnelly <pdonnell@xxxxxxxxxx>
> Signed-off-by: Luís Henriques <lhenriques@xxxxxxx>
> ---
> Hi!
>
> So, here's v2 of this RFC. Now, I guess that Patrick's idea of adding
> this counter was to validate the test results, isn't that right? If so,
> this has to be done from within the fstest code and not from teuthology
> test. The reason is that fstests mount and unmount the filesystems under
> test, which effectively wipe the metrics on the client.
>
> So, the follow-up to this patch would be changes to the corresponding
> fstests so that they would access this debugfs file and check the counter
> is set to the expected value.
>
> Cheers,
> --
> Luís
>
> fs/ceph/debugfs.c | 6 ++++++
> fs/ceph/file.c | 1 +
> fs/ceph/metric.c | 2 ++
> fs/ceph/metric.h | 2 ++
> 4 files changed, 11 insertions(+)
>
> diff --git a/fs/ceph/debugfs.c b/fs/ceph/debugfs.c
> index 38b78b45811f..9f1a09816541 100644
> --- a/fs/ceph/debugfs.c
> +++ b/fs/ceph/debugfs.c
> @@ -235,6 +235,12 @@ static int metric_show(struct seq_file *s, void *p)
> percpu_counter_sum(&m->i_caps_mis),
> percpu_counter_sum(&m->i_caps_hit));
>
> + seq_printf(s, "\n");
> + seq_printf(s, "item total\n");
> + seq_printf(s, "-------------------\n");
> + seq_printf(s, "%-14s%-16lld\n", "copy-from",
> + atomic64_read(&m->total_copyfrom));
> +
> return 0;
> }
>
> diff --git a/fs/ceph/file.c b/fs/ceph/file.c
> index e61018d9764e..b36a7b9c1ab8 100644
> --- a/fs/ceph/file.c
> +++ b/fs/ceph/file.c
> @@ -2253,6 +2253,7 @@ static ssize_t ceph_do_objects_copy(struct ceph_inode_info *src_ci, u64 *src_off
> bytes = ret;
> goto out;
> }
> + atomic64_inc(&fsc->mdsc->metric.total_copyfrom);
> len -= object_size;
> bytes += object_size;
> *src_off += object_size;
> diff --git a/fs/ceph/metric.c b/fs/ceph/metric.c
> index 04d5df29bbbf..a8a9f96c56a8 100644
> --- a/fs/ceph/metric.c
> +++ b/fs/ceph/metric.c
> @@ -278,6 +278,8 @@ int ceph_metric_init(struct ceph_client_metric *m)
> if (ret)
> goto err_total_inodes;
>
> + atomic64_set(&m->total_copyfrom, 0);
> +
> m->session = NULL;
> INIT_DELAYED_WORK(&m->delayed_work, metric_delayed_work);
>
> diff --git a/fs/ceph/metric.h b/fs/ceph/metric.h
> index 0133955a3c6a..a1e2cd46de6b 100644
> --- a/fs/ceph/metric.h
> +++ b/fs/ceph/metric.h
> @@ -169,6 +169,8 @@ struct ceph_client_metric {
> struct percpu_counter opened_inodes;
> struct percpu_counter total_inodes;
>
> + atomic64_t total_copyfrom;
> +
> struct ceph_mds_session *session;
> struct delayed_work delayed_work; /* delayed work */
> };
I know the main interest currently is just the count of ops, but I do
think that we'll want a full set of stats like we track for
reads/writes, and I'd rather not rev the file format any more than we
need to.
Could you extend struct ceph_client_metric with a full set of copy stats
and plumb in the places to record and report them? It should be pretty
similar to how reads/writes are tracked.
--
Jeff Layton <jlayton@xxxxxxxxxx>