[jira] [Assigned] (HBASE-28463) Time Based Priority for BucketCache

2024-03-27 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-28463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar reassigned HBASE-28463:
-

Assignee: Rahul Agarkar

> Time Based Priority for BucketCache
> ---
>
> Key: HBASE-28463
> URL: https://issues.apache.org/jira/browse/HBASE-28463
> Project: HBase
>  Issue Type: New Feature
>  Components: BucketCache
>Reporter: Janardhan Hungund
>Assignee: Rahul Agarkar
>Priority: Major
>
> This Jira introduces the feature of time-based data tiering in HBase to 
> optimize storage efficiency and access performance by segregating data based 
> on its recency. By keeping recent data in the bucket cache (backed by faster 
> storage types like SSDs) and evicting older data, the system aims to provide 
> a more flexible control over the cache allocation and eviction logic via 
> configuration, allowing for defining time priorities for cached data. 
> The need for a more extensive cache allocation mechanism becomes even more 
> critical on HBase deployments where cache access reflects on significant 
> performance gains, such as when using cloud storage as the underlying file 
> system.
> The data is segregated into hot or cold categories based on its age. The 
> recent data within a specific time range (configured as hot-data-age) is 
> treated as hot and is stored in the ephemeral cache, while the older data is 
> stored and accessed from the cloud storage.
> This feature intends to provide the TCO gains by optimizing the utilization 
> of high cost bucket cache. Perfect fit for the use cases that have the 
> date-based data writes while the scans focus on the recently written data.
> Please find the detailed design document of the feature attached with the 
> Jira.
> Thanks,
> Janardhan



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (HBASE-28186) Rebase CacheAwareBalance related commits into master branch

2023-11-27 Thread Rahul Agarkar (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-28186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17790199#comment-17790199
 ] 

Rahul Agarkar commented on HBASE-28186:
---

Hi [~wchevreuil] , I have raised the PR (#5541) for merging these changes into 
branch-2.

> Rebase CacheAwareBalance related commits into master branch
> ---
>
> Key: HBASE-28186
> URL: https://issues.apache.org/jira/browse/HBASE-28186
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Wellington Chevreuil
>Assignee: Wellington Chevreuil
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-28097) Add documentation section for the Cache Aware balancer function

2023-11-02 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-28097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-28097:
--
Status: Patch Available  (was: In Progress)

> Add documentation section for the Cache Aware balancer function
> ---
>
> Key: HBASE-28097
> URL: https://issues.apache.org/jira/browse/HBASE-28097
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Wellington Chevreuil
>Assignee: Rahul Agarkar
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Work started] (HBASE-28097) Add documentation section for the Cache Aware balancer function

2023-11-02 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-28097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HBASE-28097 started by Rahul Agarkar.
-
> Add documentation section for the Cache Aware balancer function
> ---
>
> Key: HBASE-28097
> URL: https://issues.apache.org/jira/browse/HBASE-28097
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Wellington Chevreuil
>Assignee: Rahul Agarkar
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27999) Implement cache aware load balancer

2023-08-23 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27999:
--
Attachment: (was: HBASE-27999 Cache Aware Load Balancer.docx)

> Implement cache aware load balancer
> ---
>
> Key: HBASE-27999
> URL: https://issues.apache.org/jira/browse/HBASE-27999
> Project: HBase
>  Issue Type: Sub-task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> HBase uses ephemeral cache to cache the blocks by reading them from the slow 
> storages and storing them to the bucket cache. This cache is warmed up 
> everytime a region server is started. Depending on the data size and the 
> configured cache size, the cache warm up can take anywhere between a few 
> minutes to few hours. Doing this everytime the region server starts can be a 
> very expensive process. To eliminate this, HBASE-27313 implemented the cache 
> persistence feature where the region servers periodically persist the blocks 
> cached in the bucket cache. This persisted information is then used to 
> resurrect the cache in the event of a region server restart because of normal 
> restart or crash.
> This feature aims at enhancing this capability of HBase to enable the 
> balancer implementation considers the cache allocation of each region on 
> region servers when calculating a new assignment plan and uses the 
> region/region server cache allocation info reported by region servers which 
> takes into account to calculate the percentage of HFiles cached for each 
> region on the hosting server, and then use that as another factor when 
> deciding on an optimal, new assignment plan.
>  
> A design document describing the balancer can be found at 
> https://docs.google.com/document/d/1A8-eVeRhZjwL0hzFw9wmXl8cGP4BFomSlohX2QcaFg4/edit?usp=sharing



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27999) Implement cache aware load balancer

2023-08-23 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27999:
--
Description: 
HBase uses ephemeral cache to cache the blocks by reading them from the slow 
storages and storing them to the bucket cache. This cache is warmed up 
everytime a region server is started. Depending on the data size and the 
configured cache size, the cache warm up can take anywhere between a few 
minutes to few hours. Doing this everytime the region server starts can be a 
very expensive process. To eliminate this, HBASE-27313 implemented the cache 
persistence feature where the region servers periodically persist the blocks 
cached in the bucket cache. This persisted information is then used to 
resurrect the cache in the event of a region server restart because of normal 
restart or crash.

This feature aims at enhancing this capability of HBase to enable the balancer 
implementation considers the cache allocation of each region on region servers 
when calculating a new assignment plan and uses the region/region server cache 
allocation info reported by region servers which takes into account to 
calculate the percentage of HFiles cached for each region on the hosting 
server, and then use that as another factor when deciding on an optimal, new 
assignment plan.

 

A design document describing the balancer can be found at 
https://docs.google.com/document/d/1A8-eVeRhZjwL0hzFw9wmXl8cGP4BFomSlohX2QcaFg4/edit?usp=sharing

  was:
HBase uses ephemeral cache to cache the blocks by reading them from the slow 
storages and storing them to the bucket cache. This cache is warmed up 
everytime a region server is started. Depending on the data size and the 
configured cache size, the cache warm up can take anywhere between a few 
minutes to few hours. Doing this everytime the region server starts can be a 
very expensive process. To eliminate this, HBASE-27313 implemented the cache 
persistence feature where the region servers periodically persist the blocks 
cached in the bucket cache. This persisted information is then used to 
resurrect the cache in the event of a region server restart because of normal 
restart or crash.

This feature aims at enhancing this capability of HBase to enable the balancer 
implementation considers the cache allocation of each region on region servers 
when calculating a new assignment plan and uses the region/region server cache 
allocation info reported by region servers which takes into account to 
calculate the percentage of HFiles cached for each region on the hosting 
server, and then use that as another factor when deciding on an optimal, new 
assignment plan..


> Implement cache aware load balancer
> ---
>
> Key: HBASE-27999
> URL: https://issues.apache.org/jira/browse/HBASE-27999
> Project: HBase
>  Issue Type: Sub-task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> HBase uses ephemeral cache to cache the blocks by reading them from the slow 
> storages and storing them to the bucket cache. This cache is warmed up 
> everytime a region server is started. Depending on the data size and the 
> configured cache size, the cache warm up can take anywhere between a few 
> minutes to few hours. Doing this everytime the region server starts can be a 
> very expensive process. To eliminate this, HBASE-27313 implemented the cache 
> persistence feature where the region servers periodically persist the blocks 
> cached in the bucket cache. This persisted information is then used to 
> resurrect the cache in the event of a region server restart because of normal 
> restart or crash.
> This feature aims at enhancing this capability of HBase to enable the 
> balancer implementation considers the cache allocation of each region on 
> region servers when calculating a new assignment plan and uses the 
> region/region server cache allocation info reported by region servers which 
> takes into account to calculate the percentage of HFiles cached for each 
> region on the hosting server, and then use that as another factor when 
> deciding on an optimal, new assignment plan.
>  
> A design document describing the balancer can be found at 
> https://docs.google.com/document/d/1A8-eVeRhZjwL0hzFw9wmXl8cGP4BFomSlohX2QcaFg4/edit?usp=sharing



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] (HBASE-27999) Implement cache aware load balancer

2023-08-23 Thread Rahul Agarkar (Jira)


[ https://issues.apache.org/jira/browse/HBASE-27999 ]


Rahul Agarkar deleted comment on HBASE-27999:
---

was (Author: ragarkar):
A design document for this balancer can be found at 
https://docs.google.com/document/d/1A8-eVeRhZjwL0hzFw9wmXl8cGP4BFomSlohX2QcaFg4/edit?usp=sharing

> Implement cache aware load balancer
> ---
>
> Key: HBASE-27999
> URL: https://issues.apache.org/jira/browse/HBASE-27999
> Project: HBase
>  Issue Type: Sub-task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> HBase uses ephemeral cache to cache the blocks by reading them from the slow 
> storages and storing them to the bucket cache. This cache is warmed up 
> everytime a region server is started. Depending on the data size and the 
> configured cache size, the cache warm up can take anywhere between a few 
> minutes to few hours. Doing this everytime the region server starts can be a 
> very expensive process. To eliminate this, HBASE-27313 implemented the cache 
> persistence feature where the region servers periodically persist the blocks 
> cached in the bucket cache. This persisted information is then used to 
> resurrect the cache in the event of a region server restart because of normal 
> restart or crash.
> This feature aims at enhancing this capability of HBase to enable the 
> balancer implementation considers the cache allocation of each region on 
> region servers when calculating a new assignment plan and uses the 
> region/region server cache allocation info reported by region servers which 
> takes into account to calculate the percentage of HFiles cached for each 
> region on the hosting server, and then use that as another factor when 
> deciding on an optimal, new assignment plan..



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (HBASE-27999) Implement cache aware load balancer

2023-08-23 Thread Rahul Agarkar (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-27999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17758364#comment-17758364
 ] 

Rahul Agarkar commented on HBASE-27999:
---

A design document for this balancer can be found at 
https://docs.google.com/document/d/1A8-eVeRhZjwL0hzFw9wmXl8cGP4BFomSlohX2QcaFg4/edit?usp=sharing

> Implement cache aware load balancer
> ---
>
> Key: HBASE-27999
> URL: https://issues.apache.org/jira/browse/HBASE-27999
> Project: HBase
>  Issue Type: Sub-task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
> Attachments: HBASE-27999 Cache Aware Load Balancer.docx
>
>
> HBase uses ephemeral cache to cache the blocks by reading them from the slow 
> storages and storing them to the bucket cache. This cache is warmed up 
> everytime a region server is started. Depending on the data size and the 
> configured cache size, the cache warm up can take anywhere between a few 
> minutes to few hours. Doing this everytime the region server starts can be a 
> very expensive process. To eliminate this, HBASE-27313 implemented the cache 
> persistence feature where the region servers periodically persist the blocks 
> cached in the bucket cache. This persisted information is then used to 
> resurrect the cache in the event of a region server restart because of normal 
> restart or crash.
> This feature aims at enhancing this capability of HBase to enable the 
> balancer implementation considers the cache allocation of each region on 
> region servers when calculating a new assignment plan and uses the 
> region/region server cache allocation info reported by region servers which 
> takes into account to calculate the percentage of HFiles cached for each 
> region on the hosting server, and then use that as another factor when 
> deciding on an optimal, new assignment plan..



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27389) Add cost function in balancer to consider the cost of building bucket cache before moving regions

2023-08-17 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27389:
--
Description: 
HBase currently uses StochasticLoadBalancer to determine the cost of moving the 
regions from one RS to another. Each cost functions give a result between 0 and 
1, with 0 being the lowest cost and 1 being the cost. The balancer iterates 
through each cost function and comes up with the total cost. Now, the balancer 
will create multiple balancing plans on random actions and try to compute the 
cost of each plan as if they are executed, if the cost of the plan is less than 
the initial cost, the plan is executed.

Implement a new "CacheAwareCostFunction" which takes into account if the region 
is fully cached and return the highest cost if the plan suggests moving this 
region.

  was:
HBase currently uses StochasticLoadBalancer to determine the cost of moving the 
regions from one RS to another. Each cost functions give a result between 0 and 
1, with 0 being the lowest cost and 1 being the cost. The balancer iterates 
through each cost function and comes up with the total cost. Now, the balancer 
will create multiple balancing plans on random actions and try to compute the 
cost of each plan as if they are executed, if the cost of the plan is less than 
the initial cost, the plan is executed.

Implement a new "PrefetchCacheCostFunction" which takes into account if the 
region is fully cached and return the highest cost if the plan suggests moving 
this region.


> Add cost function in balancer to consider the cost of building bucket cache 
> before moving regions
> -
>
> Key: HBASE-27389
> URL: https://issues.apache.org/jira/browse/HBASE-27389
> Project: HBase
>  Issue Type: Task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> HBase currently uses StochasticLoadBalancer to determine the cost of moving 
> the regions from one RS to another. Each cost functions give a result between 
> 0 and 1, with 0 being the lowest cost and 1 being the cost. The balancer 
> iterates through each cost function and comes up with the total cost. Now, 
> the balancer will create multiple balancing plans on random actions and try 
> to compute the cost of each plan as if they are executed, if the cost of the 
> plan is less than the initial cost, the plan is executed.
> Implement a new "CacheAwareCostFunction" which takes into account if the 
> region is fully cached and return the highest cost if the plan suggests 
> moving this region.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27999) Implement cache aware load balancer

2023-08-17 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27999:
--
Attachment: HBASE-27999 Cache Aware Load Balancer.docx

> Implement cache aware load balancer
> ---
>
> Key: HBASE-27999
> URL: https://issues.apache.org/jira/browse/HBASE-27999
> Project: HBase
>  Issue Type: Sub-task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
> Attachments: HBASE-27999 Cache Aware Load Balancer.docx
>
>
> HBase uses ephemeral cache to cache the blocks by reading them from the slow 
> storages and storing them to the bucket cache. This cache is warmed up 
> everytime a region server is started. Depending on the data size and the 
> configured cache size, the cache warm up can take anywhere between a few 
> minutes to few hours. Doing this everytime the region server starts can be a 
> very expensive process. To eliminate this, HBASE-27313 implemented the cache 
> persistence feature where the region servers periodically persist the blocks 
> cached in the bucket cache. This persisted information is then used to 
> resurrect the cache in the event of a region server restart because of normal 
> restart or crash.
> This feature aims at enhancing this capability of HBase to enable the 
> balancer implementation considers the cache allocation of each region on 
> region servers when calculating a new assignment plan and uses the 
> region/region server cache allocation info reported by region servers which 
> takes into account to calculate the percentage of HFiles cached for each 
> region on the hosting server, and then use that as another factor when 
> deciding on an optimal, new assignment plan..



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27999) Implement cache aware load balancer

2023-08-16 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27999:
--
Description: 
HBase uses ephemeral cache to cache the blocks by reading them from the slow 
storages and storing them to the bucket cache. This cache is warmed up 
everytime a region server is started. Depending on the data size and the 
configured cache size, the cache warm up can take anywhere between a few 
minutes to few hours. Doing this everytime the region server starts can be a 
very expensive process. To eliminate this, HBASE-27313 implemented the cache 
persistence feature where the region servers periodically persist the blocks 
cached in the bucket cache. This persisted information is then used to 
resurrect the cache in the event of a region server restart because of normal 
restart or crash.

This feature aims at enhancing this capability of HBase to enable the balancer 
implementation considers the cache allocation of each region on region servers 
when calculating a new assignment plan and uses the region/region server cache 
allocation info reported by region servers which takes into account to 
calculate the percentage of HFiles cached for each region on the hosting 
server, and then use that as another factor when deciding on an optimal, new 
assignment plan..

> Implement cache aware load balancer
> ---
>
> Key: HBASE-27999
> URL: https://issues.apache.org/jira/browse/HBASE-27999
> Project: HBase
>  Issue Type: Sub-task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> HBase uses ephemeral cache to cache the blocks by reading them from the slow 
> storages and storing them to the bucket cache. This cache is warmed up 
> everytime a region server is started. Depending on the data size and the 
> configured cache size, the cache warm up can take anywhere between a few 
> minutes to few hours. Doing this everytime the region server starts can be a 
> very expensive process. To eliminate this, HBASE-27313 implemented the cache 
> persistence feature where the region servers periodically persist the blocks 
> cached in the bucket cache. This persisted information is then used to 
> resurrect the cache in the event of a region server restart because of normal 
> restart or crash.
> This feature aims at enhancing this capability of HBase to enable the 
> balancer implementation considers the cache allocation of each region on 
> region servers when calculating a new assignment plan and uses the 
> region/region server cache allocation info reported by region servers which 
> takes into account to calculate the percentage of HFiles cached for each 
> region on the hosting server, and then use that as another factor when 
> deciding on an optimal, new assignment plan..



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27999) Implement cache aware load balancer

2023-08-16 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27999:
--
Summary: Implement cache aware load balancer  (was: Implement cache 
prefetch aware load balancer)

> Implement cache aware load balancer
> ---
>
> Key: HBASE-27999
> URL: https://issues.apache.org/jira/browse/HBASE-27999
> Project: HBase
>  Issue Type: Sub-task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27997) Enhance prefetch executor to record region prefetch information along with the list of hfiles prefetched

2023-08-03 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27997:
--
Description: 
HBASE-27313 implemented the prefetch persistence feature where it persists the 
list of hFiles prefetched in the bucket cache. This information is used to 
reconstruct the cache in the event of a server restart/crash.

Currently, only the list of hFiles is persisted.

However, for the new PrefetchAwareLoadBalancer (work in progress) to work, we 
need the information about how much a region is prefetched on a region server.

This Jira introduces an additional map in the prefetch executor to maintain the 
information about how much a region has been prefetched on that region server. 
The size of region server prefetched is calculated as the total size of all 
hFiles prefetched for that region.

> Enhance prefetch executor to record region prefetch information along with 
> the list of hfiles prefetched
> 
>
> Key: HBASE-27997
> URL: https://issues.apache.org/jira/browse/HBASE-27997
> Project: HBase
>  Issue Type: Sub-task
>  Components: BucketCache
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> HBASE-27313 implemented the prefetch persistence feature where it persists 
> the list of hFiles prefetched in the bucket cache. This information is used 
> to reconstruct the cache in the event of a server restart/crash.
> Currently, only the list of hFiles is persisted.
> However, for the new PrefetchAwareLoadBalancer (work in progress) to work, we 
> need the information about how much a region is prefetched on a region server.
> This Jira introduces an additional map in the prefetch executor to maintain 
> the information about how much a region has been prefetched on that region 
> server. The size of region server prefetched is calculated as the total size 
> of all hFiles prefetched for that region.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Work started] (HBASE-27997) Enhance prefetch executor to record region prefetch information along with the list of hfiles prefetched

2023-07-28 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HBASE-27997 started by Rahul Agarkar.
-
> Enhance prefetch executor to record region prefetch information along with 
> the list of hfiles prefetched
> 
>
> Key: HBASE-27997
> URL: https://issues.apache.org/jira/browse/HBASE-27997
> Project: HBase
>  Issue Type: Sub-task
>  Components: BucketCache
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Work started] (HBASE-27998) Enhance region metrics to include prefetch ratio for each region

2023-07-28 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HBASE-27998 started by Rahul Agarkar.
-
> Enhance region metrics to include prefetch ratio for each region
> 
>
> Key: HBASE-27998
> URL: https://issues.apache.org/jira/browse/HBASE-27998
> Project: HBase
>  Issue Type: Sub-task
>  Components: regionserver
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HBASE-27999) Implement cache prefetch aware load balancer

2023-07-28 Thread Rahul Agarkar (Jira)
Rahul Agarkar created HBASE-27999:
-

 Summary: Implement cache prefetch aware load balancer
 Key: HBASE-27999
 URL: https://issues.apache.org/jira/browse/HBASE-27999
 Project: HBase
  Issue Type: Sub-task
  Components: Balancer
Reporter: Rahul Agarkar
Assignee: Rahul Agarkar






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HBASE-27998) Enhance region metrics to include prefetch ratio for each region

2023-07-28 Thread Rahul Agarkar (Jira)
Rahul Agarkar created HBASE-27998:
-

 Summary: Enhance region metrics to include prefetch ratio for each 
region
 Key: HBASE-27998
 URL: https://issues.apache.org/jira/browse/HBASE-27998
 Project: HBase
  Issue Type: Sub-task
  Components: regionserver
Reporter: Rahul Agarkar






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Assigned] (HBASE-27998) Enhance region metrics to include prefetch ratio for each region

2023-07-28 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar reassigned HBASE-27998:
-

Assignee: Rahul Agarkar

> Enhance region metrics to include prefetch ratio for each region
> 
>
> Key: HBASE-27998
> URL: https://issues.apache.org/jira/browse/HBASE-27998
> Project: HBase
>  Issue Type: Sub-task
>  Components: regionserver
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HBASE-27997) Enhance prefetch executor to record region prefetch information along with the list of hfiles prefetched

2023-07-28 Thread Rahul Agarkar (Jira)
Rahul Agarkar created HBASE-27997:
-

 Summary: Enhance prefetch executor to record region prefetch 
information along with the list of hfiles prefetched
 Key: HBASE-27997
 URL: https://issues.apache.org/jira/browse/HBASE-27997
 Project: HBase
  Issue Type: Sub-task
  Components: BucketCache
Reporter: Rahul Agarkar
Assignee: Rahul Agarkar






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27820) HBase is not starting due to Jersey library conflicts with javax.ws.rs.api jar

2023-05-25 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27820:
--
Description: 
With some recent Atlas changes for supporting HTTP based hook support, HBase is 
not starting because of conflicts between jersey jars and rs-api jar.

This Jira is to exclude the javax.ws.rs-api.jar from the HBase classpath.

HBase uses shaded jersey jars and hence does not need to use this jar directly. 
However, it still adds this jar to the CLASSPATH while starting the server. 
Atlas on the other hand is using a non-shaded version of javax.ws.rs-api jar 
which causes this conflict and causes the hbase server fail while initializing 
atlas co-processor.

Since hbase is using shaded jersey jar and not using this jar directly, it 
should be removed from the bundle as it may cause similar conflicts with other 
client applications potentially using it.

  was:
With some recent Atlas changes for supporting HTTP based hook support, HBase is 
not starting because of conflicts between jersey jars and rs-api jar.

This Jira is to exclude the javax.ws.rs-api.jar from the HBase classpath.

HBase uses shaded jersey jars and hence does not need to use this jar directly. 
However, it still adds this jar to the CLASSPATH while starting the server. 
Atlas on the other hand is using a non-shaded version of javax.ws.rs-api jar 
which causes this conflict and causes the hbase server fail while initializing 
atlas co-processor.

This problem is not seen if only hbase server is started.


> HBase is not starting due to Jersey library conflicts with javax.ws.rs.api jar
> --
>
> Key: HBASE-27820
> URL: https://issues.apache.org/jira/browse/HBASE-27820
> Project: HBase
>  Issue Type: Task
>  Components: scripts
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> With some recent Atlas changes for supporting HTTP based hook support, HBase 
> is not starting because of conflicts between jersey jars and rs-api jar.
> This Jira is to exclude the javax.ws.rs-api.jar from the HBase classpath.
> HBase uses shaded jersey jars and hence does not need to use this jar 
> directly. However, it still adds this jar to the CLASSPATH while starting the 
> server. Atlas on the other hand is using a non-shaded version of 
> javax.ws.rs-api jar which causes this conflict and causes the hbase server 
> fail while initializing atlas co-processor.
> Since hbase is using shaded jersey jar and not using this jar directly, it 
> should be removed from the bundle as it may cause similar conflicts with 
> other client applications potentially using it.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27820) HBase is not starting due to Jersey library conflicts with javax.ws.rs.api jar

2023-05-25 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27820:
--
Description: 
With some recent Atlas changes for supporting HTTP based hook support, HBase is 
not starting because of conflicts between jersey jars and rs-api jar.

This Jira is to exclude the javax.ws.rs-api.jar from the HBase classpath.

HBase uses shaded jersey jars and hence does not need to use this jar directly. 
However, it still adds this jar to the CLASSPATH while starting the server. 
Atlas on the other hand is using a non-shaded version of javax.ws.rs-api jar 
which causes this conflict and causes the hbase server fail while initializing 
atlas co-processor.

This problem is not seen if only hbase server is started.

  was:
With some recent Atlas changes, HBase is not starting because of conflicts 
between jersey jars and rs-api jar.

This Jira is to exclude the javax.ws.rs-api.jar from the HBase classpath.


> HBase is not starting due to Jersey library conflicts with javax.ws.rs.api jar
> --
>
> Key: HBASE-27820
> URL: https://issues.apache.org/jira/browse/HBASE-27820
> Project: HBase
>  Issue Type: Task
>  Components: scripts
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> With some recent Atlas changes for supporting HTTP based hook support, HBase 
> is not starting because of conflicts between jersey jars and rs-api jar.
> This Jira is to exclude the javax.ws.rs-api.jar from the HBase classpath.
> HBase uses shaded jersey jars and hence does not need to use this jar 
> directly. However, it still adds this jar to the CLASSPATH while starting the 
> server. Atlas on the other hand is using a non-shaded version of 
> javax.ws.rs-api jar which causes this conflict and causes the hbase server 
> fail while initializing atlas co-processor.
> This problem is not seen if only hbase server is started.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Work started] (HBASE-27820) HBase is not starting due to Jersey library conflicts with javax.ws.rs.api jar

2023-04-28 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HBASE-27820 started by Rahul Agarkar.
-
> HBase is not starting due to Jersey library conflicts with javax.ws.rs.api jar
> --
>
> Key: HBASE-27820
> URL: https://issues.apache.org/jira/browse/HBASE-27820
> Project: HBase
>  Issue Type: Task
>  Components: scripts
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> With some recent Atlas changes, HBase is not starting because of conflicts 
> between jersey jars and rs-api jar.
> This Jira is to exclude the javax.ws.rs-api.jar from the HBase classpath.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HBASE-27820) HBase is not starting due to Jersey library conflicts with javax.ws.rs.api jar

2023-04-27 Thread Rahul Agarkar (Jira)
Rahul Agarkar created HBASE-27820:
-

 Summary: HBase is not starting due to Jersey library conflicts 
with javax.ws.rs.api jar
 Key: HBASE-27820
 URL: https://issues.apache.org/jira/browse/HBASE-27820
 Project: HBase
  Issue Type: Task
  Components: scripts
Reporter: Rahul Agarkar
Assignee: Rahul Agarkar


With some recent Atlas changes, HBase is not starting because of conflicts 
between jersey jars and rs-api jar.

This Jira is to exclude the javax.ws.rs-api.jar from the HBase classpath.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (HBASE-26813) Remove javax.ws.rs-api dependency

2023-04-17 Thread Rahul Agarkar (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-26813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17713131#comment-17713131
 ] 

Rahul Agarkar commented on HBASE-26813:
---

Hello [~ndimiduk] , do we have any plans to fix this issue? I was looking at a 
similar issue and tried a simple fix where I modified the bin/hbase script to 
filter out the javax.ws.rs-api* jars while adding the jar files to the 
CLASSPATH. This seems to be working for me, although, not sure if this is the 
right approach.

Thanks.

> Remove javax.ws.rs-api dependency
> -
>
> Key: HBASE-26813
> URL: https://issues.apache.org/jira/browse/HBASE-26813
> Project: HBase
>  Issue Type: Sub-task
>Affects Versions: 2.5.0, 2.6.0, 3.0.0-alpha-3
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
>Priority: Major
>
> I see that we still have {{javax.ws.rs-api}} as a dependency in our 
> hbase-http pom. If, for example, the {{ClientBuilder}} from this jar is used, 
> it'll instantiate whatever non-shaded jersey client is on the class path.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (HBASE-27389) Add cost function in balancer to consider the cost of building bucket cache before moving regions

2023-04-02 Thread Rahul Agarkar (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-27389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17707765#comment-17707765
 ] 

Rahul Agarkar commented on HBASE-27389:
---

The pull request has been updated with the following:
 # Added system tests
 # Fixed issues seen while testing the changes

> Add cost function in balancer to consider the cost of building bucket cache 
> before moving regions
> -
>
> Key: HBASE-27389
> URL: https://issues.apache.org/jira/browse/HBASE-27389
> Project: HBase
>  Issue Type: Task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> HBase currently uses StochasticLoadBalancer to determine the cost of moving 
> the regions from one RS to another. Each cost functions give a result between 
> 0 and 1, with 0 being the lowest cost and 1 being the cost. The balancer 
> iterates through each cost function and comes up with the total cost. Now, 
> the balancer will create multiple balancing plans on random actions and try 
> to compute the cost of each plan as if they are executed, if the cost of the 
> plan is less than the initial cost, the plan is executed.
> Implement a new "PrefetchCacheCostFunction" which takes into account if the 
> region is fully cached and return the highest cost if the plan suggests 
> moving this region.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Work started] (HBASE-27389) Add cost function in balancer to consider the cost of building bucket cache before moving regions

2023-01-29 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HBASE-27389 started by Rahul Agarkar.
-
> Add cost function in balancer to consider the cost of building bucket cache 
> before moving regions
> -
>
> Key: HBASE-27389
> URL: https://issues.apache.org/jira/browse/HBASE-27389
> Project: HBase
>  Issue Type: Task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> HBase currently uses StochasticLoadBalancer to determine the cost of moving 
> the regions from one RS to another. Each cost functions give a result between 
> 0 and 1, with 0 being the lowest cost and 1 being the cost. The balancer 
> iterates through each cost function and comes up with the total cost. Now, 
> the balancer will create multiple balancing plans on random actions and try 
> to compute the cost of each plan as if they are executed, if the cost of the 
> plan is less than the initial cost, the plan is executed.
> Implement a new "PrefetchCacheCostFunction" which takes into account if the 
> region is fully cached and return the highest cost if the plan suggests 
> moving this region.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27389) Add cost function in balancer to consider the cost of building bucket cache before moving regions

2023-01-29 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27389:
--
Status: Patch Available  (was: In Progress)

> Add cost function in balancer to consider the cost of building bucket cache 
> before moving regions
> -
>
> Key: HBASE-27389
> URL: https://issues.apache.org/jira/browse/HBASE-27389
> Project: HBase
>  Issue Type: Task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Assignee: Rahul Agarkar
>Priority: Major
>
> HBase currently uses StochasticLoadBalancer to determine the cost of moving 
> the regions from one RS to another. Each cost functions give a result between 
> 0 and 1, with 0 being the lowest cost and 1 being the cost. The balancer 
> iterates through each cost function and comes up with the total cost. Now, 
> the balancer will create multiple balancing plans on random actions and try 
> to compute the cost of each plan as if they are executed, if the cost of the 
> plan is less than the initial cost, the plan is executed.
> Implement a new "PrefetchCacheCostFunction" which takes into account if the 
> region is fully cached and return the highest cost if the plan suggests 
> moving this region.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HBASE-27389) Add cost function in balancer to consider the cost of building bucket cache before moving regions

2022-09-23 Thread Rahul Agarkar (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Agarkar updated HBASE-27389:
--

I am working on this and will raise a PR soon.

> Add cost function in balancer to consider the cost of building bucket cache 
> before moving regions
> -
>
> Key: HBASE-27389
> URL: https://issues.apache.org/jira/browse/HBASE-27389
> Project: HBase
>  Issue Type: Task
>  Components: Balancer
>Reporter: Rahul Agarkar
>Priority: Major
>
> HBase currently uses StochasticLoadBalancer to determine the cost of moving 
> the regions from one RS to another. Each cost functions give a result between 
> 0 and 1, with 0 being the lowest cost and 1 being the cost. The balancer 
> iterates through each cost function and comes up with the total cost. Now, 
> the balancer will create multiple balancing plans on random actions and try 
> to compute the cost of each plan as if they are executed, if the cost of the 
> plan is less than the initial cost, the plan is executed.
> Implement a new "PrefetchCacheCostFunction" which takes into account if the 
> region is fully cached and return the highest cost if the plan suggests 
> moving this region.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HBASE-27389) Add cost function in balancer to consider the cost of building bucket cache before moving regions

2022-09-23 Thread Rahul Agarkar (Jira)
Rahul Agarkar created HBASE-27389:
-

 Summary: Add cost function in balancer to consider the cost of 
building bucket cache before moving regions
 Key: HBASE-27389
 URL: https://issues.apache.org/jira/browse/HBASE-27389
 Project: HBase
  Issue Type: Task
  Components: Balancer
Reporter: Rahul Agarkar


HBase currently uses StochasticLoadBalancer to determine the cost of moving the 
regions from one RS to another. Each cost functions give a result between 0 and 
1, with 0 being the lowest cost and 1 being the cost. The balancer iterates 
through each cost function and comes up with the total cost. Now, the balancer 
will create multiple balancing plans on random actions and try to compute the 
cost of each plan as if they are executed, if the cost of the plan is less than 
the initial cost, the plan is executed.

Implement a new "PrefetchCacheCostFunction" which takes into account if the 
region is fully cached and return the highest cost if the plan suggests moving 
this region.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)