On Tue, 2021-10-26 at 11:05 +0800, Xiubo Li wrote:
On 10/22/21 1:30 AM, Patrick Donnelly wrote:I'd rather not multiplex the output of this file based on some input.
On Thu, Oct 21, 2021 at 12:35 PM Jeff Layton <jlayton@xxxxxxxxxx> wrote:Any suggestion to improve this ?
On Thu, 2021-10-21 at 12:18 -0400, Patrick Donnelly wrote:Okay that makes more sense!
On Thu, Oct 21, 2021 at 11:44 AM Jeff Layton <jlayton@xxxxxxxxxx> wrote:Yes. For instance:
On Thu, 2021-10-21 at 09:52 -0400, Patrick Donnelly wrote:Maybe I'm confused. Is there some "file" which is already used for
On Wed, Oct 20, 2021 at 12:27 PM Jeff Layton <jlayton@xxxxxxxxxx> wrote:Tracking this sort of thing is the main purpose of the stats code. I'm
On Wed, 2021-10-20 at 15:37 +0100, Luís Henriques wrote:I think it's a good idea to integrate this into "stats" but I think a
This counter will keep track of the number of remote object copies done onI think this would be better integrated into the stats infrastructure.
copy_file_range syscalls. This counter will be filesystem per-client, and
can be accessed from the client debugfs directory.
Cc: Patrick Donnelly <pdonnell@xxxxxxxxxx>
Signed-off-by: Luís Henriques <lhenriques@xxxxxxx>
---
This is an RFC to reply to Patrick's request in [0]. Note that I'm not
100% sure about the usefulness of this patch, or if this is the best way
to provide the functionality Patrick requested. Anyway, this is just to
get some feedback, hence the RFC.
Cheers,
--
Luís
[0] https://github.com/ceph/ceph/pull/42720
Maybe you could add a new set of "copy" stats to struct
ceph_client_metric that tracks the total copy operations done, their
size and latency (similar to read and write ops)?
local debugfs file for some counters is still useful. The "stats"
module is immature at this time and I'd rather not build any qa tests
(yet) that rely on it.
Can we generalize this patch-set to a file named "op_counters" or
similar and additionally add other OSD ops performed by the kclient?
really not keen on adding a whole separate set of files for reporting
this.
this type of debugging information? Or do you mean the code for
sending stats to the MDS to support cephfs-top?
What's the specific problem with relying on the data in debugfsMaybe no problem? I wasn't aware of a "metrics" file.
"metrics" file?
# cat /sys/kernel/debug/ceph/*/metrics
item total
------------------------------------------
opened files / total inodes 0 / 4
pinned i_caps / total inodes 5 / 4
opened inodes / total inodes 0 / 4
item total avg_lat(us) min_lat(us) max_lat(us) stdev(us)
-----------------------------------------------------------------------------------
read 0 0 0 0 0
write 5 914013 824797 1092343 103476
metadata 79 12856 1572 114572 13262
item total avg_sz(bytes) min_sz(bytes) max_sz(bytes) total_sz(bytes)
----------------------------------------------------------------------------------------
read 0 0 0 0 0
write 5 4194304 4194304 4194304 20971520
item total miss hit
-------------------------------------------------
d_lease 11 0 29
caps 5 68 10702
I'm proposing that Luis add new lines for "copy" to go along with the
"read" and "write" ones. The "total" counter should give you a count of
the number of operations.
Side note: I am a bit horrified by how computer-unfriendly that
table-formatted data is.
How about just make the "metric" file writable like a switch ? And as
default it will show the data as above and if tools want the
computer-friendly format, just write none-zero to it, then show raw data
just like:
# cat /sys/kernel/debug/ceph/*/metrics
opened_files:0
pinned_i_caps:5
opened_inodes:0
total_inodes:4
read_latency:0,0,0,0,0
write_latency:5,914013,824797,1092343,103476
metadata_latency:79,12856,1572,114572,13262
read_size:0,0,0,0,0
write_size:5,4194304,4194304,4194304,20971520
d_lease:11,0,29
caps:5,68,10702
That would also be rather hard to do -- write() and read() are two
different syscalls, so you'd need to track a bool (or something) across
them somehow.
Currently, I doubt there are many scripts in the field that scrape this
info and debugfs is specifically excluded from ABI concerns. If we want
to make it more machine-readable (which sounds like a good thing), then
I suggest we just change the output to something like what you have
above and not worry about preserving the "legacy" output.