[
https://issues.apache.org/jira/browse/SOLR-12514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mahesh Kumar Vasanthu Somashekar updated SOLR-12514:
----------------------------------------------------
Description:
Solr serves client requests going throught 3 steps - init(), authorize() and
handle-request ([link
git-link|https://github.com/apache/lucene-solr/blob/releases/lucene-solr/7.3.1/solr/core/src/java/org/apache/solr/servlet/HttpSolrCall.java#L471]).
init() initializes all required information to be used by authorize(). init()
skips initializing if request is to be served remotely, which leads to skipping
authorization step ([link
git-link|https://github.com/apache/lucene-solr/blob/releases/lucene-solr/7.3.1/solr/core/src/java/org/apache/solr/servlet/HttpSolrCall.java#L291]).
init() relies on 'cores' object which only has information of local node
(which is perfect as per design). It should actually be getting security
information (security.json) from zookeeper, which has global view of the
cluster.
Example:
SolrCloud setup consists of 2 nodes (solr-7.3.1):
live_nodes: [
"localhost:8983_solr",
"localhost:8984_solr",
]
Two collections are created - 'collection-rf-1' with RF=1 and 'collection-rf-2'
with RF=2.
Two users are created - 'collection-rf-1-user' and 'collection-rf-2-user'.
Security configuration is as below (security.json attached):
"authorization":{
"class":"solr.RuleBasedAuthorizationPlugin",
"permissions":[
{ "name":"read", "collection":"collection-rf-2", "role":"collection-rf-2",
"index":1}
,
{ "name":"read", "collection":"collection-rf-1", "role":"collection-rf-1",
"index":2}
,
{ "name":"read", "role":"*", "index":3}
,
...
"user-role":
{ "collection-rf-1-user":[ "collection-rf-1"], "collection-rf-2-user":[
"collection-rf-2"]}
,
...
Basically, its setup to that 'collection-rf-1-user' user can only access
'collection-rf-1' collection and 'collection-rf-2-user' user can only access
'collection-rf-2' collection.
Also note that 'collection-rf-1' collection replica is only on
'localhost:8983_solr' node, whereas ''collection-rf-2' collection replica is on
both live nodes.
Authorization does not work as expected for 'collection-rf-1' collection:
$ curl -u collection-rf-2-user:password
'http://*localhost:8983*/solr/collection-rf-1/select?q=*:*'
<html>
<head>
<meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
<title>Error 403 Unauthorized request, Response code: 403</title>
</head>
<body><h2>HTTP ERROR 403</h2>
<p>Problem accessing /solr/collection-rf-1/select. Reason:
<pre> Unauthorized request, Response code: 403</pre></p>
</body>
</html>
$ curl -u collection-rf-2-user:password
'http://*localhost:8984*/solr/collection-rf-1/select?q=*:*'
{
"responseHeader":{
"zkConnected":true,
"status":0,
"QTime":0,
"params":{
"q":"*:*"}},
"response":{"numFound":0,"start":0,"docs":[]
}}
Whereas authorization works perfectly for 'collection-rf-2' collection (as both
nodes have replica):
$ curl -u collection-rf-1-user:password
'http://*localhost:8984*/solr/collection-rf-2/select?q=*:*'
<html>
<head>
<meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
<title>Error 403 Unauthorized request, Response code: 403</title>
</head>
<body><h2>HTTP ERROR 403</h2>
<p>Problem accessing /solr/collection-rf-2/select. Reason:
<pre> Unauthorized request, Response code: 403</pre></p>
</body>
</html>
$ curl -u collection-rf-1-user:password
'http://*localhost:8983*/solr/collection-rf-2/select?q=*:*'
<html>
<head>
<meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
<title>Error 403 Unauthorized request, Response code: 403</title>
</head>
<body><h2>HTTP ERROR 403</h2>
<p>Problem accessing /solr/collection-rf-2/select. Reason:
<pre> Unauthorized request, Response code: 403</pre></p>
</body>
</html>
was:
Solr serves client requests going throught 3 steps - init(), authorize() and
handle-request.
init() initializes all required information to be used by authorize(). init()
skips initializing if request is to be served remotely, which leads to skipping
authorization step.
init() relies on 'cores' object which only has information of local node (which
is perfect as per design). It should actually be getting security information
(security.json) from zookeeper, which has global view of the cluster.
Example:
SolrCloud setup consists of 2 nodes (solr-7.3.1):
live_nodes: [
"localhost:8983_solr",
"localhost:8984_solr",
]
Two collections are created - 'collection-rf-1' with RF=1 and 'collection-rf-2'
with RF=2.
Two users are created - 'collection-rf-1-user' and 'collection-rf-2-user'.
Security configuration is as below (security.json attached):
"authorization":{
"class":"solr.RuleBasedAuthorizationPlugin",
"permissions":[{
"name":"read",
"collection":"collection-rf-2",
"role":"collection-rf-2",
"index":1},
{
"name":"read",
"collection":"collection-rf-1",
"role":"collection-rf-1",
"index":2},
{
"name":"read",
"role":"*",
"index":3},
...
"user-role":{
"collection-rf-1-user":[
"collection-rf-1"],
"collection-rf-2-user":[
"collection-rf-2"]},
...
Basically, its setup to that 'collection-rf-1-user' user can only access
'collection-rf-1' collection and 'collection-rf-2-user' user can only access
'collection-rf-2' collection.
Also note that 'collection-rf-1' collection replica is only on
'localhost:8983_solr' node, whereas ''collection-rf-2' collection replica is on
both live nodes.
Authorization does not work as expected for 'collection-rf-1' collection:
$ curl -u collection-rf-2-user:password
'http://*localhost:8983*/solr/collection-rf-1/select?q=*:*'
<html>
<head>
<meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
<title>Error 403 Unauthorized request, Response code: 403</title>
</head>
<body><h2>HTTP ERROR 403</h2>
<p>Problem accessing /solr/collection-rf-1/select. Reason:
<pre> Unauthorized request, Response code: 403</pre></p>
</body>
</html>
$ curl -u collection-rf-2-user:password
'http://*localhost:8984*/solr/collection-rf-1/select?q=*:*'
{
"responseHeader":{
"zkConnected":true,
"status":0,
"QTime":0,
"params":{
"q":"*:*"}},
"response":{"numFound":0,"start":0,"docs":[]
}}
Whereas authorization works perfectly for 'collection-rf-2' collection (as both
nodes have replica):
$ curl -u collection-rf-1-user:password
'http://*localhost:8984*/solr/collection-rf-2/select?q=*:*'
<html>
<head>
<meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
<title>Error 403 Unauthorized request, Response code: 403</title>
</head>
<body><h2>HTTP ERROR 403</h2>
<p>Problem accessing /solr/collection-rf-2/select. Reason:
<pre> Unauthorized request, Response code: 403</pre></p>
</body>
</html>
$ curl -u collection-rf-1-user:password
'http://*localhost:8983*/solr/collection-rf-2/select?q=*:*'
<html>
<head>
<meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
<title>Error 403 Unauthorized request, Response code: 403</title>
</head>
<body><h2>HTTP ERROR 403</h2>
<p>Problem accessing /solr/collection-rf-2/select. Reason:
<pre> Unauthorized request, Response code: 403</pre></p>
</body>
</html>
> Rule-base Authorization plugin skips authorization if querying node does not
> have collection replica
> ----------------------------------------------------------------------------------------------------
>
> Key: SOLR-12514
> URL: https://issues.apache.org/jira/browse/SOLR-12514
> Project: Solr
> Issue Type: Bug
> Security Level: Public(Default Security Level. Issues are Public)
> Components: security
> Affects Versions: 7.3.1
> Reporter: Mahesh Kumar Vasanthu Somashekar
> Priority: Major
> Attachments: Screen Shot 2018-06-24 at 9.36.45 PM.png, security.json
>
>
> Solr serves client requests going throught 3 steps - init(), authorize() and
> handle-request ([link
> git-link|https://github.com/apache/lucene-solr/blob/releases/lucene-solr/7.3.1/solr/core/src/java/org/apache/solr/servlet/HttpSolrCall.java#L471]).
> init() initializes all required information to be used by authorize().
> init() skips initializing if request is to be served remotely, which leads to
> skipping authorization step ([link
> git-link|https://github.com/apache/lucene-solr/blob/releases/lucene-solr/7.3.1/solr/core/src/java/org/apache/solr/servlet/HttpSolrCall.java#L291]).
> init() relies on 'cores' object which only has information of local node
> (which is perfect as per design). It should actually be getting security
> information (security.json) from zookeeper, which has global view of the
> cluster.
>
> Example:
> SolrCloud setup consists of 2 nodes (solr-7.3.1):
> live_nodes: [
> "localhost:8983_solr",
> "localhost:8984_solr",
> ]
> Two collections are created - 'collection-rf-1' with RF=1 and
> 'collection-rf-2' with RF=2.
> Two users are created - 'collection-rf-1-user' and 'collection-rf-2-user'.
> Security configuration is as below (security.json attached):
> "authorization":{
> "class":"solr.RuleBasedAuthorizationPlugin",
> "permissions":[
> { "name":"read", "collection":"collection-rf-2", "role":"collection-rf-2",
> "index":1}
> ,
> { "name":"read", "collection":"collection-rf-1", "role":"collection-rf-1",
> "index":2}
> ,
> { "name":"read", "role":"*", "index":3}
> ,
> ...
> "user-role":
> { "collection-rf-1-user":[ "collection-rf-1"], "collection-rf-2-user":[
> "collection-rf-2"]}
> ,
> ...
>
> Basically, its setup to that 'collection-rf-1-user' user can only access
> 'collection-rf-1' collection and 'collection-rf-2-user' user can only access
> 'collection-rf-2' collection.
> Also note that 'collection-rf-1' collection replica is only on
> 'localhost:8983_solr' node, whereas ''collection-rf-2' collection replica is
> on both live nodes.
>
> Authorization does not work as expected for 'collection-rf-1' collection:
> $ curl -u collection-rf-2-user:password
> 'http://*localhost:8983*/solr/collection-rf-1/select?q=*:*'
> <html>
> <head>
> <meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
> <title>Error 403 Unauthorized request, Response code: 403</title>
> </head>
> <body><h2>HTTP ERROR 403</h2>
> <p>Problem accessing /solr/collection-rf-1/select. Reason:
> <pre> Unauthorized request, Response code: 403</pre></p>
> </body>
> </html>
> $ curl -u collection-rf-2-user:password
> 'http://*localhost:8984*/solr/collection-rf-1/select?q=*:*'
> {
> "responseHeader":{
> "zkConnected":true,
> "status":0,
> "QTime":0,
> "params":{
> "q":"*:*"}},
> "response":{"numFound":0,"start":0,"docs":[]
> }}
>
> Whereas authorization works perfectly for 'collection-rf-2' collection (as
> both nodes have replica):
> $ curl -u collection-rf-1-user:password
> 'http://*localhost:8984*/solr/collection-rf-2/select?q=*:*'
> <html>
> <head>
> <meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
> <title>Error 403 Unauthorized request, Response code: 403</title>
> </head>
> <body><h2>HTTP ERROR 403</h2>
> <p>Problem accessing /solr/collection-rf-2/select. Reason:
> <pre> Unauthorized request, Response code: 403</pre></p>
> </body>
> </html>
> $ curl -u collection-rf-1-user:password
> 'http://*localhost:8983*/solr/collection-rf-2/select?q=*:*'
> <html>
> <head>
> <meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
> <title>Error 403 Unauthorized request, Response code: 403</title>
> </head>
> <body><h2>HTTP ERROR 403</h2>
> <p>Problem accessing /solr/collection-rf-2/select. Reason:
> <pre> Unauthorized request, Response code: 403</pre></p>
> </body>
> </html>
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]