I decreased the sampling interval to reduce the base size from 40,000 to
10,000 so that I could then use the DBScan routine.

Now another problem has arisen: I want to analyze the "Noisy Samples"
points and I need to calculate the distance to the nearest cluster, ie (a)
the distance to the nearest point and know (b) which cluster this point
belongs to . I believe these data are available because the algorithm
calculates this distance, but only marks the point that has that distance
greater than the EPS as "Noisy Samples".

I believe the routine needs to be changed because it has only the output
attributes "core_sample_indices_", "components_" and "labels_" are
available.

Can you help me?

Att.,
Mauricio Reis

2018-05-18 19:29 GMT-03:00 Shane Grigsby <shane.grig...@colorado.edu>:

> Hi Mauricio,
> You can also use OPTICS in DBSCAN mode. The pull request is here if you'd
> like to clone it:
>
> https://github.com/scikit-learn/scikit-learn/pull/1984
>
> Running ~40,000 points in three dimensions takes about a minute. See the
> example page here for how to do the DBSCAN extraction:
>
> https://github.com/espg/scikit-learn/blob/2eac9fbf67b2715e11
> fdedfbb63bcdb56a80e216/examples/cluster/plot_optics.py
>
> Cheers,
> Shane
>
> On 05/17, Mauricio Reis wrote:
>
>> I'm not used to the terms used here. So I understood that the package had
>> memory management, which was removed. But you could make the code
>> available
>> with memory management implementations. Is it?! :-)
>> The problem is that I do not know what I would do with the code, because I
>> only know how to work with the SciKitLearn package ready. :-(
>>
>> Att.,
>> Mauricio Reis
>>
>> 2018-05-16 20:33 GMT-03:00 Joel Nothman <joel.noth...@gmail.com>:
>>
>> Implemented in a previous version of #10280
>>> <https://github.com/scikit-learn/scikit-learn/pull/10280>, but removed
>>> for now to simplify reviews
>>> <https://github.com/scikit-learn/scikit-learn/pull/10280#pul
>>> lrequestreview-95622713>.
>>> If others would like to review #10280, I'm happy to follow up with the
>>> changes requested here, which have already been implemented by Aman
>>> Dalmia
>>> and myself.​
>>>
>>> _______________________________________________
>>> scikit-learn mailing list
>>> scikit-learn@python.org
>>> https://mail.python.org/mailman/listinfo/scikit-learn
>>>
>>>
>>>
> _______________________________________________
>> scikit-learn mailing list
>> scikit-learn@python.org
>> https://mail.python.org/mailman/listinfo/scikit-learn
>>
>
>
> --
> *PhD candidate & Research Assistant*
> *Cooperative Institute for Research in Environmental Sciences (CIRES)*
> *University of Colorado at Boulder*
> _______________________________________________
> scikit-learn mailing list
> scikit-learn@python.org
> https://mail.python.org/mailman/listinfo/scikit-learn
>
_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to