Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Accelerate DBSCAN #48

Open
relleums opened this issue Nov 8, 2017 · 0 comments
Open

Accelerate DBSCAN #48

relleums opened this issue Nov 8, 2017 · 0 comments

Comments

@relleums
Copy link
Member

relleums commented Nov 8, 2017

DBSCAN is currently the limiting time factor when analysing photon-stream.

I reach about 50 events/s single thread DBSCAN clustering. using the plain sklearn implementation.
Reading from jsonl is about 300 events/s and reading from binary phs is about 15k events/s with the current python reader.

DBSCAN creates an octree structure every time it is called to reduce the computational expense of calculating all distances between all points (photons in the stream).
DBSCAN tries to populate a matrix of distances between points. Points which are obviously to far apart are not considered. Based on this matrix, the clustering is done.
However, in FACT we know the spatial relations of the photons in advance and might be able to accelerate the creation of the distance matrix

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant