On 03/23/2015 05:45 PM, Petr Kulhavy wrote:
> Hi Peter,
>
> I've just posted a patch to the community, you should have received another
> email from me just a few minutes ago.
> Basically there should be a kfree in terminate_all() just before echan->edesc
> is set to NULL.
> The reason is that at that point the edesc is in none of the vchan lists
> (edma_execute() removes it from the "issued" list)
> and vchan_get_all_descriptors() doesn't find it.
And this is the main issue in my view. The desc should be in one of the lists
IMHO.
> With the extra kfree the mem leak is not seen any more.
> It's a good question why terminate_all is called before the transfer finished,
> but I thought it could be called at any time?
>
> I'm also not sure if the echan->edesc shouldn't be free also in
> edma_execute(), could you please check that? :
>
> static void edma_execute(struct edma_chan *echan)
> {
> struct virt_dma_desc *vdesc;
> struct edma_desc *edesc;
> struct device *dev = echan->vchan.chan.device->dev;
> int i, j, left, nslots;
>
> /* If either we processed all psets or we're still not started */
> if (!echan->edesc ||
> echan->edesc->pset_nr == echan->edesc->processed) {
> /* Get next vdesc */
> vdesc = vchan_next_desc(&echan->vchan);
> if (!vdesc) {
> dev_dbg(dev, "Setting edesc 0x%p to NULL\n",echan->edesc);
> echan->edesc = NULL;
> #warning "possible memory leak here ?"
Yes, it would be, but in fact this will never happen since the edma_execute()
will not be called when echan->edesc is not NULL _and_ (echan->edesc->pset_nr
== echan->edesc->processed).
if we have echan->edesc, we are in a middle of transfer, moving from one batch
of sg to another batch.
I do have cleanup patches for this part to clarify the situation.
> return;
> }
> list_del(&vdesc->node); // at this point node was in issued
> echan->edesc = to_edma_desc(&vdesc->tx);
> }
>
> I'll post another patch for the wrong type in edma_alloc_chan_resources()
>
>
> Regards
> Petr
>
>
> On 23.03.2015 16:28, Peter Ujfalusi wrote:
>> On 03/20/2015 11:53 PM, Petr Kulhavy wrote:
>>> Hi Peter,
>>>
>>> yes, that is one of the differences to the EVM that the SD card is on
>>> MMCSD1.
>>> This is due to the pin multiplexer and other peripherals we're using.
>>>
>>> Your patch is correct, however the edma_callback() is using the channel
>>> number parameter for debug messages only. For the actual work echan->ch_num
>>> is
>>> used. BTW is that correct? Could there be a mismatch between the
>>> echan->ch_num
>>> and the ch_num parameter?
>>>
>>> Something else seems to be odd in edma_alloc_chan_resources():
>>>
>>> /* Alloc channel resources */
>>> static int edma_alloc_chan_resources(struct dma_chan *chan)
>>> {
>>> struct edma_chan *echan = to_edma_chan(chan);
>>> struct device *dev = chan->device->dev;
>>> int ret;
>>> int a_ch_num;
>>> LIST_HEAD(descs);
>>>
>>> a_ch_num = edma_alloc_channel(echan->ch_num, edma_callback,
>>> chan, EVENTQ_DEFAULT);
>> Hah, yes this is wrong but worked so far fine because:
>> struct edma_chan {
>> struct virt_dma_chan vchan;
>> ...
>> };
>>
>> struct virt_dma_chan {
>> struct dma_chan chan;
>> ...
>> };
>>
>> so &edma_chan == &edma_chan.vchan.chan;
>>
>> But this is not why you see the leak.
>>
>> What I did on my board is that I have swapped in SW the cc0 and cc1, now mmc0
>> is on cc1 from the sw point of view, but still can not reproduce the issue.
>>
>>> The third parameter to edma_alloc_channel() should be echan, not chan, since
>>> the edma_callback() interprets the callback data parameter as struct
>>> edma_chan *.
>>>
>>> Let me know if you find something or if you have an advice for more debug.
>> From the log I would go and see what happens in the
>> vchan_get_all_descriptors() and vchan_dma_desc_free_list(), is it so that at
>> terminate_all time the list we got is empty? But why it is?
>>
>> It seams you got the terminate_all call before the transfer finished, this is
>> also interesting. I'll look at at this more deeply.
>> What I see is that at terminate_all we clear the echan->edesc and for some
>> reason the vchan code will not free the desc. Later the completion interrupt
>> comes, but since the echan->edesc is NULL we do nothing. This causes the
>> leak.
>>
>> The question to all of this why and how to reproduce it?
>>
>
--
Péter
--
To unsubscribe from this list: send the line "unsubscribe linux-omap" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html