[GitHub] [couchdb] nickva opened a new pull request #3599: Improve basic auth credentials handling in replicator

GitBox Thu, 03 Jun 2021 11:33:15 -0700


nickva opened a new pull request #3599:
URL: https://github.com/apache/couchdb/pull/3599



   This is a port of commit ecd266b0e87f44e1080cabdb4c28e4758f5a4406 from 3.x 
to main. Including the same commit message from there for completeness below. 
Then, towards the end, there is a description of changes required to port this 
PR to main.
   
   Previously, there were two ways to pass in basic auth credentials for 
endpoints -- using URL's userinfo part, and encoding them in an 
`"Authorization": "basic..."` header. Neither one is ideal for these reasons:
   
    * Passwords in userinfo don't allow using ":", "@" and other characters. 
However, even after switching to always unquoting them like we did recently 
[1], it could break authentication for usernames or passwords previously
      containing "+" or "%HH" patterns, as "+" might now be decoded to a " ".
   
    * Base64 encoded headers need an extra step to encode them. Also, quite 
often  these encoded headers are confused as being "encrypted" and shared in a 
clear channel.
   
   To improve this, revert the recent commit to unquote URL userinfo parts to 
restore backwards compatibility, and introduce a way to pass in basic auth 
credentials in the "auth" object. The "auth" object was already added a while
   back to allow authentication plugins to store their credentials in it. The 
format is:
   ```
      "source": {
          "url": "https://host/db";,
          "auth": {
              "basic": {
                  "username":"myuser",
                  "password":"mypassword"
              }
          }
      }
   ```
   
   {"auth" : "basic" : {...}} object is checked first, and if credentials are 
provided, they will be used. If they are not then userinfo and basic auth 
header will be parsed.
   
   Internally, there was a good amount duplication related to parsing 
credentials from userinfo and headers in replication ID generation logic and in 
the auth session plugin. As a cleanup, consolidate that logic in the 
`couch_replicator_parse` module.
   
   The commit is quite different from the 3.x one for these two reasons:
   
    * `main` uses two types of replication endpoint "objects": `#httpdb` 
records and `HttpDb` maps. In most cases it uses maps which can be serialized 
and deserialized to and from json. But in lower level, connection handling code
      in couch_replicator_httpc, it uses `#httpdb` records. This explain the 
need to still handle both representations. Auth session plugin, for instance, 
uses the lower level #httpdb records while replicator ID handling code uses the 
map based one.
   
    * `main` has all the parsing of replication documents and `_replicate` 
request bodies in a separate `couch_replicator_parse`. So, most of the code 
which handles normalizing basic auth creds is there instead of   
`couch_replicator_docs` or `couch_replicator_utils` like it is in 3.x
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [couchdb] nickva opened a new pull request #3599: Improve basic auth credentials handling in replicator

Reply via email to