https://bugs.kde.org/show_bug.cgi?id=426003

--- Comment #7 from Thanh Trung Dinh <dinhthanhtrung1...@gmail.com> ---
Hi,

As @markd said, it may be useful for users who want to search for images
relating to 'sushi' or some specific objects, but in my opinion, the scope
of this project needs to be reviewed carefully. Since YOLO is designed for
object detection in general, there will be plenty of results for some
trivial objects such as: table, spoon, banana, etc. Moreover, I've seen
many cases where objects detected by YOLO are in the corners or not clearly
visible. So, maybe an image tagged with sushi but it's far away from the
view.

Moreover, for specific objects (like sushi, plants, monuments, etc.) I
suppose we need a YOLO version trained on specific datasets for those
objects (or users may train the network themselves). Therefore, we really
need to define clearly the objects that we aim to include in digikam for
object detection.

So the project is really interesting, but I would propose to create a poll
from digikam users to get an idea on what object detection we want to
support. Otherwise, a more extensible way but requiring some work from
users is to design code templates for object detection (extending from
facesengine). Then, users only need to train and provide the weights for
the network to run the detection on their own.

Best,
Trung

On Mon, Aug 31, 2020 at 12:43 AM markd <bugzilla_nore...@kde.org> wrote:

> https://bugs.kde.org/show_bug.cgi?id=426003
>
> --- Comment #6 from markd <citbparpmakajjecpg@kiabws.online> ---
> Hi Nghia,
>
> Very happy to hear that you work on that topic. Sounds so great, thanks for
> your hard work.
>
> As I am a Digikam User and a iphone user it would be great to have the
> following use case.
>
> 1. Each picture run to the yolo model and get assign 0, 1 or more than 1
> object
>
> 2. If an object is wrong then the user can delete or update it (by update i
> mean chosing 1 of the many existing object of the yolo model). But it
> would be
> very annoying that i have to verify if each predicted object is correct or
> not!!!
>
>
> With my iphone the use case is the following.
>
> I take a picture of something lets say a sushis.
>
> Then several days later i want to see all picture of sushi i took
> So I go to the reseach bar and type 'sushi' then i see all the picture of
> sushi.
>
> Would be great to have this feature.
>
>
> In addition would be also great to have tags of each assigned object [like
> for
> people + manual tags], so there would be a category 'objects' with its
> subcat
> 'tree' 'sushi' etc  then i could simply click on sushi to see all picture
> of
> sushis
>
> --
> You are receiving this mail because:
> You are on the CC list for the bug.

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to