[ 
https://issues.apache.org/jira/browse/TUBEMQ-120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guocheng Zhang updated TUBEMQ-120:
----------------------------------
    Fix Version/s:     (was: 0.5.0)
                   0.6.0

> Aligned disk data storage
> -------------------------
>
>                 Key: TUBEMQ-120
>                 URL: https://issues.apache.org/jira/browse/TUBEMQ-120
>             Project: Apache TubeMQ
>          Issue Type: Sub-task
>            Reporter: Guocheng Zhang
>            Assignee: Guocheng Zhang
>            Priority: Major
>             Fix For: 0.6.0
>
>         Attachments: image-2020-05-15-11-28-20-118.png
>
>
> 1. Data read and write operations should consider the characteristics of the 
> disk, for example, the disk is based on 512-byte sectors as its storage unit, 
> and read data in batches of 64k; the file system will eliminate the cache 
> according to certain rules Pages in memory etc. If the read and write 
> operations take these contents into account, I believe that the current TPS 
> can be higher;
> -----------------------------------
> I understand the reason for this problem, the problem I think is that the 
> data alignment needs to be considered when storing data to disk, so that the 
> head can read the data with as few accesses as possible. For example, the 
> data b in the picture, when it unaligned stored, two sectors need to be 
> accessed, but the head only needs to access and read one sector after aligned 
> storage:
> !image-2020-05-15-11-28-20-118.png!  
>  The performance difference of this problem may not be obvious under a single 
> access, but under the reading of massive messages, the performance will be 
> essentially improved from quantitative to qualitative changes, especially 
> since TubeMQ internally reads data randomly, it is more necessary to consider 
> this .
> After the modification, a new problem introduced based on this modification 
> needs our attention, that is, the storage space used after the system 
> modification will increase under the same data amount. As shown in the 
> picture, because the data is aligned, for small packet data (below the sector 
> size ) a, will occupy more space. However, considering that the overall 
> performance of the disk is unchanged, and the disk space can reach a very 
> high capacity, this waste is considered acceptable to me
> For the optimization of this problem, do you have any suggest ? If not, I am 
> going to claim this modification and optimize it according to this idea.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to