-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA256 Hi folks,
I'm working on a new parser to replace SearchQueryParenthesizedParser that understands nested parentheses. The SearchQueryParenthesizedParser has some unit tests that live in cds-invenio/modules/websearch/lib/search_engine_query_parser_tests.py. At least one of those tests checks on the behavior of the parser when dealing with quotes. It puts in this query string: "(expr1) - expr2 | 'expressions - in + quotes | (are) not - parsed ' - (expr3) | expr4" and expects back this result: ['+', 'expr1', '-', "expr2 | 'expressions - in + quotes | (are) not - parsed '", '-', 'expr3', '|', 'expr4'] I contend that the behavior of (the existing) SQPP is actually incorrect in this case. I think the result should be ['+', 'expr1', '-', 'expr2', '|', '+', "'expressions - in + quotes | (are) not - parsed '", '-', 'expr3', '|', 'expr4'] Does this look as bizarre to anyone else as it does to me? Joe -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iQIcBAEBCAAGBQJL4IuOAAoJEGh+D3e/PaCrJpsP/iE/sY6aGT72DyT6y9x4tv6+ yEWwBjeSxU4iYGUAnpPJTuMbMQImjG4gDWDSq19rNGkUhdXdccDgWPTQbzZMqOqD SZSkDpQe5lPChc162+4avTkoa4szwMnEuQivTKb67bj4nRSB0cG6kxCtyyovbjEv pbbaayZNbj+21wSTDCtfzZMNqtBjx7WSuRZDrj3NrgEp0i61eXdzqwYA7jmAJ2zq PbRJPyDXH/C4Ck8HP84FAKtTg+hYN+hKC/sj5PS1KIucFJBS/JXD+Z05nI71VumJ gWMqnP1JoXWUnqF6cT85ChhVrrzaXs8lrZ6SZTSSKjwnRWtvFRcLb+VIrchwLxvv gGuZbkq67ZJMQliO++/syhjvWd9cLlvehpAIS521YjnCJI+wBlz+Lf88ASkLWcwq 0gwaL+5ufWNBKksig+8REU8MVqHfj912xihOwGgWbmFzbWqNq8j3zyg8enzj/04e d+vKU6/vabtOulVxxjrz4TgvZEdD4mGJpUawhJXeHikX5WMTrIjvMLS3AxaHHFzg BjKOIQfRgfHyFrxI8VpUiGFnsoqVaKctecavq2HuWq0vf3LfX2v/ZR8TNYMEaSNj Qq0ONWt/QjnIQTw2PWu2UZqaQW4Rk6D3TrpmkMxPOc9bSFLfqbWNS+GOgbglQ9Wi CI94nTlBg0kTPT/yWvxS =g1Bu -----END PGP SIGNATURE-----
