Re: [webkit-dev] Smart Pointer Analysis Tool for WebKit

Emilio Cobos Álvarez Thu, 17 Sep 2020 02:53:39 -0700

Interesting. This looks fairly similar to some of the checkers we use inmozilla-central, fwiw.

One interesting difference is that we opted for explicitly annotatingthe functions that can run script (think updateStyleIfNeeded(),dispatchEvent() etc equivalents) to be able to not warn for cases whereusing raw pointers is fine. See [1] for the current rules we're using.

So, I wonder... for a concrete example like [2], what is what wouldallow you to use shadowHost() without storing it on a local RefPtrotherwise, for example? Or is the plan to either pay the refcount churn,or silence the warnings on a per-case basis?


Cheers,

  -- Emilio

[1]:https://searchfox.org/mozilla-central/rev/f4b4008f5ee00f5afa3095f48c54f16828e4b22b/build/clang-plugin/CanRunScriptChecker.cpp#5-49[2]:https://webkit-search.igalia.com/webkit/rev/4c54a6d287d7fba30e1fb37d5afda692fb12a758/Source/WebCore/dom/Node.cpp#1041


On 9/17/20 8:32 AM, Ryosuke Niwa wrote:

Hi all,
I’ve been working with Geoff (ggaren) and Jan Korous from Apple'scompiler team to create a static analyzer which detects dangerous use ofref counted objects, and we’re looking for folks who are interested intrying out this tool and helping us refine the tool. Please let me knowif you’re interested in using this tool & try applying code changes toWebKit. Our goal is to eventually integrate this tool into buildbot andEWS so that we can catch dangerous code before they get committed.
*What is Dangerous?*
So *what is* a /dangerous/ use of ref counted objects you may ask? It’sany use of which we can’t trivially conclude that it doesn’t lead to ause-after-free.
For now, we don’t detect dangerous use of non-ref counted objectsincluding ones that can vend WeakPtr. It’s on us, humans, to decidewhich objects need to be ref counted or need to be CanMakeWeakPtr.
Consider the following example. This code most certainly will lead to ause-after-free of “parent” in the third line because the code doesn’tkeep the parent alive. Because isContentEditable updates style in somecases, it could lead to arbitrary script execution which could removethe parent from the document.
Node* parent = element->parentElement();
if (parent->isContentEditable())
     parent->scrollIntoView();
In general, relying on a complex data structure such as DOM tree to keepRefCounted objects alive while we call a non-trivial function is notsafe. All it takes for the code to have a use-after-free is for someoneto start updating style, layout, etc… inside the function eitherdirectly or indirectly. And we don’t want to make WebKit developmentreally hard by forcing anyone who modifies a function to check everycaller of the function and their callers, etc… to make sure it’s safe todo so.
For this reason, it’s /dangerous/ to store a raw pointer or a referenceto a ref counted object as a local variable and use it after calling anon-trivial function. We did a similar analysis of a number of otherpatterns and usage of ref counted objects in WebKit and came up with thefollowing basic rules for using ref counted objects in a safe manner.We’re hoping that these rules would be eventually incorporated into ourcoding style guideline: https://webkit.org/code-style-guidelines/<https://webkit.org/code-style-guidelines/>
*Rules for Using Ref Counted Objects*

 1. Every data member to a ref counted object must use either Ref,
    RefPtr, or WeakPtr. webkit.NoUncountedMemberChecker
    
<https://github.com/llvm/llvm-project/blob/master/clang/docs/analyzer/checkers.rst#id17>
 2. Every ref counted base class, if it has a derived class, must define
    a virtual destructor. webkit.RefCntblBaseVirtualDtor
    
<https://github.com/llvm/llvm-project/blob/master/clang/docs/analyzer/checkers.rst#id16>
 3. Every ref counted object passed to a non-trivial function as an
    argument (including "this" pointer) should be stored as a Ref or
    RefPtr in the caller’s local scope unless it's an argument to the
    caller itself by transitive property [1].
    alpha.webkit.UncountedCallArgsChecker
    
<https://github.com/llvm/llvm-project/blob/master/clang/docs/analyzer/checkers.rst#id21>
 4. Every ref counted object must be captured using Ref or RefPtr for
    lambda. webkit.UncountedLambdaCapturesChecker
    
<https://github.com/llvm/llvm-project/blob/master/clang/docs/analyzer/checkers.rst#id18>
 5. Local variables - we’re still working on this
    (https://reviews.llvm.org/D83259 <https://reviews.llvm.org/D83259>)
Below, I've dissected each one of these rules with the real warningemitted by the analysis tool in development. Please let me know if youhave any comments / concerns.
------------------------------------------------------------------------
(1) is pretty trivial. Every ref counted data member should be storedusing Ref, RefPtr, or WeakPtr since it would be not trivially obviousthat life cycles of two or more objects are correctly tied or managedtogether.
------------------------------------------------------------------------
(2) is also pretty easy to understand. In the following example, ifsomeone destroys an instance of B using Ref<A>, then it would result inan undefined behavior.
struct A : public RefCounted<A> {
         Vector<int> someData;
};

struct B : public A {
         Vector<int> otherData;
};

------------------------------------------------------------------------
For (3), passing a ref counted object as a raw pointer or reference to afunction as an argument, the tool may generate a warning like this:
Source/WebCore/html/FormAssociatedElement.cpp:181:13: warning: [WebKit]call argument is a raw pointers to a ref-countable type[-Wfunction-arg-ptr-to-refcntbl]
     setForm(findAssociatedForm(&asHTMLElement(), originalForm.get()));
             ^
This warns that void setForm(HTMLFormElement*) is called with the resultof findAssociatedForm, which returns HTMLFormElement* without storing itanywhere. If setForm can somehow cause HTMLFormElement to be deleted,then this can result in the use-after-free in setForm.
void FormAssociatedElement::resetFormOwner()
{
     RefPtr<HTMLFormElement> originalForm = m_form.get();
setForm(findAssociatedForm(&asHTMLElement(), originalForm.get()));
     HTMLElement& element = asHTMLElement();
     auto* newForm = m_form.get();
     if (newForm && newForm != originalForm && newForm->isConnected())
         element.document().didAssociateFormControl(element);
}
Why, you may ask, we don't put HTMLFormElement* in a Ref or RefPtr insetForm instead? This is because a lot of code has calls to a functionwith a local variable. Because the callee doesn’t know whether thecaller has already stored each argument in Ref / RefPtr or not, we needto be safe and store them again in Ref / RefPtr, resulting in anunnecessary ref-churn. Additionally, if we took the approach of thecallee being responsible for keeping every argument alive, then everynon-trivial member function of a ref counted object must have"protectedThis" at the beginning of the function, which would beparticularly wasteful if those member functions are sometimes called byother member functions.
Once this rule is applied everywhere, for example, we can get rid of“protectedThis” from our codebase because it would be redundant.Furthermore, this rule transitively allows a raw pointer or a referencepassed in as an argument to be used as arguments to call anotherfunction without storing it in Ref or RefPtr.
For example, the following function satisfies this rule because botholdDocument and newDocument are raw references passed to this functionas arguments, and therefore the caller of this function must havealready stored as Ref / RefPtr as local variables.
void HTMLMediaElement::didMoveToNewDocument(Document& oldDocument,Document& newDocument)
{
     ASSERT_WITH_SECURITY_IMPLICATION(&document() == &newDocument);
     if (m_shouldDelayLoadEvent) {
         oldDocument.decrementLoadEventDelayCount();
         newDocument.incrementLoadEventDelayCount();
     }

     unregisterWithDocument(oldDocument);
     registerWithDocument(newDocument);

     HTMLElement::didMoveToNewDocument(oldDocument, newDocument);
     updateShouldAutoplay();
}

------------------------------------------------------------------------
For (4), capturing a ref counted object as a raw pointer or a reference,the tool may generate a warning like this:
Source/WebKit/UIProcess/API/APIHTTPCookieStore.cpp:115:74: warning:[WebKit] captured a raw pointer to a ref-countable type[-Wlambda-capture-ptr-to-refcntbl] cookieManager->deleteCookie(m_owningDataStore->sessionID(), cookie,[pool = WTFMove(pool), completionHandler = WTFMove(completionHandler)]()mutable {
                                                                          ^
This is warning about webPage object getting captured using a rawreference in the following function:
void HTTPCookieStore::deleteCookie(const WebCore::Cookie& cookie,CompletionHandler<void()>&& completionHandler)
{
auto* pool =m_owningDataStore->processPoolForCookieStorageOperations();
     if (!pool) {
if (m_owningDataStore->sessionID() ==PAL::SessionID::defaultSessionID() && !cookie.session)
             deleteCookieFromDefaultUIProcessCookieStore(cookie);
         else
             m_owningDataStore->removePendingCookie(cookie);
RunLoop::main().dispatch([completionHandler =WTFMove(completionHandler)] () mutable {
             completionHandler();
         });
         return;
     }
auto* cookieManager =pool->supplement<WebKit::WebCookieManagerProxy>(); cookieManager->deleteCookie(m_owningDataStore->sessionID(), cookie,[pool = WTFMove(pool), completionHandler = WTFMove(completionHandler)]()mutable {
         completionHandler();
     });
}

Here, we should have stored WebProcessPool in Ref instead.

- R. Niwa

_______________________________________________
webkit-dev mailing list
[email protected]
https://lists.webkit.org/mailman/listinfo/webkit-dev

_______________________________________________
webkit-dev mailing list
[email protected]
https://lists.webkit.org/mailman/listinfo/webkit-dev

Re: [webkit-dev] Smart Pointer Analysis Tool for WebKit

Reply via email to