Re: [PATCH v2] docs: automarkup.py: Allow automatic cross-reference inside C namespace

2020-11-30 Thread Jonathan Corbet
On Tue, 17 Nov 2020 02:12:01 +
Nícolas F. R. A. Prado  wrote:

> Sphinx 3.1 introduced namespaces for C cross-references. With this,
> each C domain type/function declaration is put inside the namespace that
> was active at the time of its declaration.
> 
> Add support for automatic cross-referencing inside C namespaces by
> checking whether the corresponding source file had a C namespace Sphinx
> directive, and if so, try cross-referencing inside of it before going to
> the global scope.
> 
> This assumes there's only one namespace (if any) per rst file.
> 
> Signed-off-by: Nícolas F. R. A. Prado 
> ---
> 
> To those following from v1:
> 
> I ended up doing the simplest solution possible, which is to just directly 
> read
> the rst source corresponding to the doc page right before doing the 
> automarkup.
> It's not very efficient in the sense that the source is being read
> twice (first by Sphinx, then by this), but it sidesteps the "data sharing
> between processes" issue, so parallel_read_safe can be reenabled, and I didn't
> notice any performance hit from this patch (as opposed to the big hit from 
> v1).
> Works with both Sphinx 2 and 3.

OK, I've (finally) applied this, thanks.  It does indeed seem to work.
Still hoping for something more elegant someday...:)

Thanks,

jon


Re: [PATCH v2] docs: automarkup.py: Allow automatic cross-reference inside C namespace

2020-11-18 Thread Nícolas F . R . A . Prado
On Wed Nov 18, 2020 at 5:37 PM -03, Jonathan Corbet wrote:
>
> On Tue, 17 Nov 2020 02:12:01 +
> Nícolas F. R. A. Prado  wrote:
>
> > Sphinx 3.1 introduced namespaces for C cross-references. With this,
> > each C domain type/function declaration is put inside the namespace that
> > was active at the time of its declaration.
> >
> > Add support for automatic cross-referencing inside C namespaces by
> > checking whether the corresponding source file had a C namespace Sphinx
> > directive, and if so, try cross-referencing inside of it before going to
> > the global scope.
> >
> > This assumes there's only one namespace (if any) per rst file.
> >
> > Signed-off-by: Nícolas F. R. A. Prado 
> > ---
> >
> > To those following from v1:
> >
> > I ended up doing the simplest solution possible, which is to just directly 
> > read
> > the rst source corresponding to the doc page right before doing the 
> > automarkup.
> > It's not very efficient in the sense that the source is being read
> > twice (first by Sphinx, then by this), but it sidesteps the "data sharing
> > between processes" issue, so parallel_read_safe can be reenabled, and I 
> > didn't
> > notice any performance hit from this patch (as opposed to the big hit from 
> > v1).
> > Works with both Sphinx 2 and 3.
>
> The solution does lack elegance, but it is a solution, which is more than
> we had before :)

Exactly :P

> That said, rather than re-opening and re-reading the
> file, why not just connect to the source-read event, which will happily
> hand you the document source that it has already read?

Unfortunately that wouldn't work. What would happen is that Sphinx would spawn a
python process to handle the source-read event for file X, then later it would
spawn another python process to handle doctree-resolved for the same file X.
Being two different processes, data can't easily be shared between them, which
is why I originally disabled parallel_read_safe, to force everything into a
single process, enabling data to be stored in the source-read phase to be later
used at the doctree-resolved phase.

What we need is a single process that both reads the source and uses that info
to make the auto markup. With parallel_read_safe enabled, that is only possible
by doing everything in a single Sphinx event (namely, doctree-resolved), and
therefore the read needs to be done manually outside of Sphinx.

Thanks,
Nícolas

>
> Thanks,
>
> jon



Re: [PATCH v2] docs: automarkup.py: Allow automatic cross-reference inside C namespace

2020-11-18 Thread Jonathan Corbet
On Tue, 17 Nov 2020 12:30:13 +
Nícolas F. R. A. Prado  wrote:

> > Hmm... do we still need to skip syscalls?  
> 
> Yeah, I see what you mean. Since you moved the syscalls in the docs inside
> namespaces, there shouldn't be any syscall definitions in the global scope
> anymore and therefore we don't need to skip them any longer.
> 
> I tried it out here and indeed it works fine without skipping them.
> 
> But I wonder if it would be a good safety measure to leave it there anyway. We
> never want to cross-reference to syscalls in the global scope, so if we 
> continue
> doing that skip, even if someone accidentally adds a syscall definition 
> outside
> a c:namespace, this will prevent cross-references to it anyway.
> 
> What do you think?

I put the original skip logic in there to keep it from even trying to
cross-reference common syscall names; I wasn't really even worried about
false references at that point.  I'd leave the check in unless it's
actively causing trouble somewhere...

Thanks,

jon


Re: [PATCH v2] docs: automarkup.py: Allow automatic cross-reference inside C namespace

2020-11-18 Thread Jonathan Corbet
On Tue, 17 Nov 2020 02:12:01 +
Nícolas F. R. A. Prado  wrote:

> Sphinx 3.1 introduced namespaces for C cross-references. With this,
> each C domain type/function declaration is put inside the namespace that
> was active at the time of its declaration.
> 
> Add support for automatic cross-referencing inside C namespaces by
> checking whether the corresponding source file had a C namespace Sphinx
> directive, and if so, try cross-referencing inside of it before going to
> the global scope.
> 
> This assumes there's only one namespace (if any) per rst file.
> 
> Signed-off-by: Nícolas F. R. A. Prado 
> ---
> 
> To those following from v1:
> 
> I ended up doing the simplest solution possible, which is to just directly 
> read
> the rst source corresponding to the doc page right before doing the 
> automarkup.
> It's not very efficient in the sense that the source is being read
> twice (first by Sphinx, then by this), but it sidesteps the "data sharing
> between processes" issue, so parallel_read_safe can be reenabled, and I didn't
> notice any performance hit from this patch (as opposed to the big hit from 
> v1).
> Works with both Sphinx 2 and 3.

The solution does lack elegance, but it is a solution, which is more than
we had before :)  That said, rather than re-opening and re-reading the
file, why not just connect to the source-read event, which will happily
hand you the document source that it has already read?

Thanks,

jon


Re: [PATCH v2] docs: automarkup.py: Allow automatic cross-reference inside C namespace

2020-11-17 Thread Nícolas F . R . A . Prado
On Tue Nov 17, 2020 at 2:47 AM -03, Mauro Carvalho Chehab wrote:
>
> Em Tue, 17 Nov 2020 02:12:01 +
> Nícolas F. R. A. Prado  escreveu:
>
> > Sphinx 3.1 introduced namespaces for C cross-references. With this,
> > each C domain type/function declaration is put inside the namespace that
> > was active at the time of its declaration.
> >
> > Add support for automatic cross-referencing inside C namespaces by
> > checking whether the corresponding source file had a C namespace Sphinx
> > directive, and if so, try cross-referencing inside of it before going to
> > the global scope.
> >
> > This assumes there's only one namespace (if any) per rst file.
> >
> > Signed-off-by: Nícolas F. R. A. Prado 
> > ---
> >
> > To those following from v1:
> >
> > I ended up doing the simplest solution possible, which is to just directly 
> > read
> > the rst source corresponding to the doc page right before doing the 
> > automarkup.
> > It's not very efficient in the sense that the source is being read
> > twice (first by Sphinx, then by this), but it sidesteps the "data sharing
> > between processes" issue, so parallel_read_safe can be reenabled, and I 
> > didn't
> > notice any performance hit from this patch (as opposed to the big hit from 
> > v1).
> > Works with both Sphinx 2 and 3.
> >
> > Changes in v2:
> > - Get C namespace from reading the corresponding source at the time of doing
> >   automarkup instead of storing all namespaces beforehand at the source-read
> >   phase
> > - Add get_c_namespace()
> > - Remove save_c_namespace()
> > - Reenabled Sphinx's parallel_read_safe
> >
> > v1: 
> > https://lore.kernel.org/linux-doc/20201013231218.2750109-6-nfrapr...@protonmail.com/
> >
> >  Documentation/sphinx/automarkup.py | 122 ++---
> >  1 file changed, 75 insertions(+), 47 deletions(-)
> >
> > diff --git a/Documentation/sphinx/automarkup.py 
> > b/Documentation/sphinx/automarkup.py
> > index 3e81ebab26ed..953b24b6e2b4 100644
> > --- a/Documentation/sphinx/automarkup.py
> > +++ b/Documentation/sphinx/automarkup.py
> > @@ -53,6 +53,8 @@ RE_typedef = re.compile(r'\b(typedef)\s+([a-zA-Z_]\w+)', 
> > flags=ascii_p3)
> >  #
> >  RE_doc = re.compile(r'\bDocumentation(/[\w\-_/]+)(\.\w+)*')
> >
> > +RE_namespace = re.compile(r'^\s*..\s*c:namespace::\s*(\S+)\s*$')
> > +
> >  #
> >  # Reserved C words that we should skip when cross-referencing
> >  #
> > @@ -70,6 +72,8 @@ Skipfuncs = [ 'open', 'close', 'read', 'write', 'fcntl', 
> > 'mmap',
> >'select', 'poll', 'fork', 'execve', 'clone', 'ioctl',
> >'socket' ]
> >
> > +c_namespace = ''
> > +
> >  def markup_refs(docname, app, node):
> >  t = node.astext()
> >  done = 0
> > @@ -128,30 +132,38 @@ def markup_func_ref_sphinx3(docname, app, match):
> >  #
> >  # Go through the dance of getting an xref out of the C domain
> >  #
> > -target = match.group(2)
> > +base_target = match.group(2)
> >  target_text = nodes.Text(match.group(0))
> >  xref = None
> > -if not (target in Skipfuncs or target in Skipnames):
> > -for class_s, reftype_s in zip(class_str, reftype_str):
> > -lit_text = nodes.literal(classes=['xref', 'c', class_s])
> > -lit_text += target_text
> > -pxref = addnodes.pending_xref('', refdomain = 'c',
> > -  reftype = reftype_s,
> > -  reftarget = target, modname = 
> > None,
> > -  classname = None)
> > -#
> > -# XXX The Latex builder will throw NoUri exceptions here,
> > -# work around that by ignoring them.
> > -#
> > -try:
> > -xref = cdom.resolve_xref(app.env, docname, app.builder,
> > - reftype_s, target, pxref,
> > - lit_text)
> > -except NoUri:
> > -xref = None
> > +possible_targets = [base_target]
> > +# Check if this document has a namespace, and if so, try
> > +# cross-referencing inside it first.
> > +if c_namespace:
> > +possible_targets.insert(0, c_namespace + "." + base_target)
> >
> > -if xref:
> > -return xref
> > +if base_target not in Skipnames:
> > +for target in possible_targets:
> > +if target not in Skipfuncs:
>
> Hmm... do we still need to skip syscalls?

Yeah, I see what you mean. Since you moved the syscalls in the docs inside
namespaces, there shouldn't be any syscall definitions in the global scope
anymore and therefore we don't need to skip them any longer.

I tried it out here and indeed it works fine without skipping them.

But I wonder if it would be a good safety measure to leave it there anyway. We
never want to cross-reference to syscalls in the global scope, so if we continue
doing that skip, even if someone 

Re: [PATCH v2] docs: automarkup.py: Allow automatic cross-reference inside C namespace

2020-11-16 Thread Mauro Carvalho Chehab
Em Tue, 17 Nov 2020 02:12:01 +
Nícolas F. R. A. Prado  escreveu:

> Sphinx 3.1 introduced namespaces for C cross-references. With this,
> each C domain type/function declaration is put inside the namespace that
> was active at the time of its declaration.
> 
> Add support for automatic cross-referencing inside C namespaces by
> checking whether the corresponding source file had a C namespace Sphinx
> directive, and if so, try cross-referencing inside of it before going to
> the global scope.
> 
> This assumes there's only one namespace (if any) per rst file.
> 
> Signed-off-by: Nícolas F. R. A. Prado 
> ---
> 
> To those following from v1:
> 
> I ended up doing the simplest solution possible, which is to just directly 
> read
> the rst source corresponding to the doc page right before doing the 
> automarkup.
> It's not very efficient in the sense that the source is being read
> twice (first by Sphinx, then by this), but it sidesteps the "data sharing
> between processes" issue, so parallel_read_safe can be reenabled, and I didn't
> notice any performance hit from this patch (as opposed to the big hit from 
> v1).
> Works with both Sphinx 2 and 3.
> 
> Changes in v2:
> - Get C namespace from reading the corresponding source at the time of doing
>   automarkup instead of storing all namespaces beforehand at the source-read
>   phase
> - Add get_c_namespace()
> - Remove save_c_namespace()
> - Reenabled Sphinx's parallel_read_safe
> 
> v1: 
> https://lore.kernel.org/linux-doc/20201013231218.2750109-6-nfrapr...@protonmail.com/
> 
>  Documentation/sphinx/automarkup.py | 122 ++---
>  1 file changed, 75 insertions(+), 47 deletions(-)
> 
> diff --git a/Documentation/sphinx/automarkup.py 
> b/Documentation/sphinx/automarkup.py
> index 3e81ebab26ed..953b24b6e2b4 100644
> --- a/Documentation/sphinx/automarkup.py
> +++ b/Documentation/sphinx/automarkup.py
> @@ -53,6 +53,8 @@ RE_typedef = re.compile(r'\b(typedef)\s+([a-zA-Z_]\w+)', 
> flags=ascii_p3)
>  #
>  RE_doc = re.compile(r'\bDocumentation(/[\w\-_/]+)(\.\w+)*')
>  
> +RE_namespace = re.compile(r'^\s*..\s*c:namespace::\s*(\S+)\s*$')
> +
>  #
>  # Reserved C words that we should skip when cross-referencing
>  #
> @@ -70,6 +72,8 @@ Skipfuncs = [ 'open', 'close', 'read', 'write', 'fcntl', 
> 'mmap',
>'select', 'poll', 'fork', 'execve', 'clone', 'ioctl',
>'socket' ]
>  
> +c_namespace = ''
> +
>  def markup_refs(docname, app, node):
>  t = node.astext()
>  done = 0
> @@ -128,30 +132,38 @@ def markup_func_ref_sphinx3(docname, app, match):
>  #
>  # Go through the dance of getting an xref out of the C domain
>  #
> -target = match.group(2)
> +base_target = match.group(2)
>  target_text = nodes.Text(match.group(0))
>  xref = None
> -if not (target in Skipfuncs or target in Skipnames):
> -for class_s, reftype_s in zip(class_str, reftype_str):
> -lit_text = nodes.literal(classes=['xref', 'c', class_s])
> -lit_text += target_text
> -pxref = addnodes.pending_xref('', refdomain = 'c',
> -  reftype = reftype_s,
> -  reftarget = target, modname = None,
> -  classname = None)
> -#
> -# XXX The Latex builder will throw NoUri exceptions here,
> -# work around that by ignoring them.
> -#
> -try:
> -xref = cdom.resolve_xref(app.env, docname, app.builder,
> - reftype_s, target, pxref,
> - lit_text)
> -except NoUri:
> -xref = None
> +possible_targets = [base_target]
> +# Check if this document has a namespace, and if so, try
> +# cross-referencing inside it first.
> +if c_namespace:
> +possible_targets.insert(0, c_namespace + "." + base_target)
>  
> -if xref:
> -return xref
> +if base_target not in Skipnames:
> +for target in possible_targets:
> +if target not in Skipfuncs:

Hmm... do we still need to skip syscalls?

> +for class_s, reftype_s in zip(class_str, reftype_str):
> +lit_text = nodes.literal(classes=['xref', 'c', class_s])
> +lit_text += target_text
> +pxref = addnodes.pending_xref('', refdomain = 'c',
> +  reftype = reftype_s,
> +  reftarget = target, 
> modname = None,
> +  classname = None)
> +#
> +# XXX The Latex builder will throw NoUri exceptions here,
> +# work around that by ignoring them.
> +#
> +try:
> +

[PATCH v2] docs: automarkup.py: Allow automatic cross-reference inside C namespace

2020-11-16 Thread Nícolas F . R . A . Prado
Sphinx 3.1 introduced namespaces for C cross-references. With this,
each C domain type/function declaration is put inside the namespace that
was active at the time of its declaration.

Add support for automatic cross-referencing inside C namespaces by
checking whether the corresponding source file had a C namespace Sphinx
directive, and if so, try cross-referencing inside of it before going to
the global scope.

This assumes there's only one namespace (if any) per rst file.

Signed-off-by: Nícolas F. R. A. Prado 
---

To those following from v1:

I ended up doing the simplest solution possible, which is to just directly read
the rst source corresponding to the doc page right before doing the automarkup.
It's not very efficient in the sense that the source is being read
twice (first by Sphinx, then by this), but it sidesteps the "data sharing
between processes" issue, so parallel_read_safe can be reenabled, and I didn't
notice any performance hit from this patch (as opposed to the big hit from v1).
Works with both Sphinx 2 and 3.

Changes in v2:
- Get C namespace from reading the corresponding source at the time of doing
  automarkup instead of storing all namespaces beforehand at the source-read
  phase
- Add get_c_namespace()
- Remove save_c_namespace()
- Reenabled Sphinx's parallel_read_safe

v1: 
https://lore.kernel.org/linux-doc/20201013231218.2750109-6-nfrapr...@protonmail.com/

 Documentation/sphinx/automarkup.py | 122 ++---
 1 file changed, 75 insertions(+), 47 deletions(-)

diff --git a/Documentation/sphinx/automarkup.py 
b/Documentation/sphinx/automarkup.py
index 3e81ebab26ed..953b24b6e2b4 100644
--- a/Documentation/sphinx/automarkup.py
+++ b/Documentation/sphinx/automarkup.py
@@ -53,6 +53,8 @@ RE_typedef = re.compile(r'\b(typedef)\s+([a-zA-Z_]\w+)', 
flags=ascii_p3)
 #
 RE_doc = re.compile(r'\bDocumentation(/[\w\-_/]+)(\.\w+)*')
 
+RE_namespace = re.compile(r'^\s*..\s*c:namespace::\s*(\S+)\s*$')
+
 #
 # Reserved C words that we should skip when cross-referencing
 #
@@ -70,6 +72,8 @@ Skipfuncs = [ 'open', 'close', 'read', 'write', 'fcntl', 
'mmap',
   'select', 'poll', 'fork', 'execve', 'clone', 'ioctl',
   'socket' ]
 
+c_namespace = ''
+
 def markup_refs(docname, app, node):
 t = node.astext()
 done = 0
@@ -128,30 +132,38 @@ def markup_func_ref_sphinx3(docname, app, match):
 #
 # Go through the dance of getting an xref out of the C domain
 #
-target = match.group(2)
+base_target = match.group(2)
 target_text = nodes.Text(match.group(0))
 xref = None
-if not (target in Skipfuncs or target in Skipnames):
-for class_s, reftype_s in zip(class_str, reftype_str):
-lit_text = nodes.literal(classes=['xref', 'c', class_s])
-lit_text += target_text
-pxref = addnodes.pending_xref('', refdomain = 'c',
-  reftype = reftype_s,
-  reftarget = target, modname = None,
-  classname = None)
-#
-# XXX The Latex builder will throw NoUri exceptions here,
-# work around that by ignoring them.
-#
-try:
-xref = cdom.resolve_xref(app.env, docname, app.builder,
- reftype_s, target, pxref,
- lit_text)
-except NoUri:
-xref = None
+possible_targets = [base_target]
+# Check if this document has a namespace, and if so, try
+# cross-referencing inside it first.
+if c_namespace:
+possible_targets.insert(0, c_namespace + "." + base_target)
 
-if xref:
-return xref
+if base_target not in Skipnames:
+for target in possible_targets:
+if target not in Skipfuncs:
+for class_s, reftype_s in zip(class_str, reftype_str):
+lit_text = nodes.literal(classes=['xref', 'c', class_s])
+lit_text += target_text
+pxref = addnodes.pending_xref('', refdomain = 'c',
+  reftype = reftype_s,
+  reftarget = target, modname 
= None,
+  classname = None)
+#
+# XXX The Latex builder will throw NoUri exceptions here,
+# work around that by ignoring them.
+#
+try:
+xref = cdom.resolve_xref(app.env, docname, app.builder,
+ reftype_s, target, pxref,
+ lit_text)
+except NoUri:
+xref = None
+
+if xref:
+return xref
 
 return