-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Experiment with some clustering algorithms on top of the similarity metric #8
Comments
I want to work on this. Could you please provide a description explaining this issue a little. Thanks! |
We have a way to evaluate the distance between two crash traces (WMD, but in the future we might experiment with other distance metrics #9), we can use this distance to cluster the stack traces in groups. We should test with some clustering algorithms (http://scikit-learn.org/stable/modules/clustering.html) and see how they perform. If the implemented algorithm turns out to be too slow (it's possible, as WMD is really slow), we can try two things:
But let's not worry about this slowness problem for now. Let's try with the WMD distance and a well-known algorithm first. |
With WMD, we can only use those clustering algorithms in which metric used is distance between points or others (eg-: Euclidean distance ) can also be used? |
Can you rephrase this question? Euclidean distance is a distance between points too. |
How exactly should we compare the various algorthims? |
A possible approach is #39. |
No description provided.
The text was updated successfully, but these errors were encountered: