Daniel Phillips <phillips@arcor.de> wrote:
>
> Yes, I think that's exactly what's happening. There are some questions
> remaining, such as why doesn't it happen to akpm.
Oh but it does. The 30,000 mkdirs test takes 76 seconds on 1k blocks, 16
seconds on 4k blocks.
With 1k blocks, the journal is 1/4 the size, and there are 4x as many
blockgroups.
So not only does the fs have to checkpoint 4x as often, it has to seek all
over the disk to do it.
If you create a 1k blocksize fs with a 100MB journal, the test takes just
nine seconds.
But note that after these nine seconds, we were left with 50MB of dirty
buffercache. When pdflush later comes along to write that out it takes >30
seconds, because of the additional fragmentation from the tiny blockgroups.
So all we've done is to pipeline the pain.
I suspect the Orlov allocator isn't doing the right thing here - a full
dumpe2fs of the disk shows that the directories are all over the place.
Nodoby has really looked at fine-tuning Orlov.
btw, an `rm -rf' of the dir which holds 30,000 dirs takes 50 seconds system
time on a 2.7GHz CPU. What's up with that?
c01945bc str2hashbuf 9790 61.1875
c0187064 ext3_htree_store_dirent 10472 28.7692
c0154920 __getblk 11319 202.1250
c018d11c dx_probe 15934 24.2896
c018d4d0 ext3_htree_fill_tree 15984 33.8644
c0188d3c ext3_get_branch 18238 81.4196
c0136388 find_get_page 18617 202.3587
c015477c bh_lru_install 20493 94.8750
c0194290 TEA_transform 21463 173.0887
c01536b0 __find_get_block_slow 24069 62.6797
c0154620 __brelse 36139 752.8958
c01893dc ext3_get_block_handle 43815 49.7898
c0154854 __find_get_block 71292 349.4706
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
This archive was generated by hypermail 2b29 : Sat Mar 15 2003 - 22:00:16 EST