Variance adjusted UniFrac #20

wasade · 2017-08-17T00:02:42Z

Dependent on #18 and #19.

Tests are not written for VA unweighted, weighted unnormalized and generalized unifrac. These need to be done by hand as there isn't a reference implementation.

…ifrac

ElDeveloper · 2017-08-18T05:48:40Z

Once more, could you update this branch?

wasade · 2017-08-19T16:09:00Z

@ElDeveloper all green

ElDeveloper

Looks great! I have not tested this locally as I am not familiar at all with variance adjusted UniFrac, so I don't really know what to expect.

ElDeveloper · 2017-08-21T07:45:11Z

sucpp/test_su.cpp

@@ -743,6 +743,48 @@ void test_generalized_unifrac() {
    SUITE_END();
 }

+void test_vaw_unifrac_weighted_normalized() {


Minor stylistic note, I checked in vim and this test case has a mix of tabs and spaces. Up to you if you want to fix. 👍

ElDeveloper · 2017-08-21T07:46:34Z

sucpp/unifrac.cpp

@@ -322,6 +321,56 @@ void progressbar(float progress) {
    std::cout.flush();
 }

+void initialize_embedded(double*& prop, const su::task_parameters* task_p) {
+	posix_memalign((void **)&prop, 32, sizeof(double) * task_p->n_samples * 2);


This line also has a tab instead of spaces ¯\_(ツ)_/¯

ElDeveloper · 2017-08-21T07:52:41Z

sucpp/unifrac.cpp

+                        Method unifrac_method, 
+                        const su::task_parameters* task_p) {
+    for(unsigned int i = task_p->start; i < task_p->stop; i++){
+        posix_memalign((void **)&dm_stripes[i], 32, sizeof(double) * task_p->n_samples);


Just in case, the interwebs seem to indicate that posix_memalign has a return value specific for out of memory, ENOMEM.

Updating all instances

ElDeveloper · 2017-08-21T07:55:17Z

sucpp/unifrac.cpp

+            break;
+        case generalized:
+            func = &su::_vaw_generalized_unifrac_task;
+            break;


Worth adding a default case where this returns an error and the program terminates?

added. I'm not too concerned as the switch is on an enum so this can only arise if the switch receives an enum value that is not described from su::Method which I believe is a compile-time constraint. But it is defensive and could also arise from programmer error.

ElDeveloper · 2017-08-21T08:02:39Z

sucpp/unifrac_task.cpp

+         * at the moment. basically, we can't assume the presence of avx2.
+         */
+        for(unsigned int j = 0; j < task_p->n_samples / 4; j++) {
+            int k = j * 4;


k is used here and in a loop below, perhaps worth a different letter ;)

Not sure I follow? k is instantiated here as it is the stride offset. Or is it because k is used in the in the cleanup loop as well? I also think it is appropriate to stick with k there as the variable should not leak scope, and the use of the variable are consistent.

Fair enough, just thought it was confusing, I wasn't concerned with the leakage.

ElDeveloper · 2017-08-21T08:04:55Z

sucpp/unifrac_task.cpp

+        dm_stripe_total = dm_stripes_total[stripe];
+
+        for(unsigned int j = 0; j < task_p->n_samples / 4; j++) {
+            int k = j * 4;


Same as above regarding k

ElDeveloper · 2017-08-21T08:08:28Z

sucpp/unifrac_task.cpp

+                double sum1 = (u1 + v1) / vaw;
+                double sub1 = fabs(u1 - v1) / vaw;
+                double sum_pow1 = pow(sum1, task_p->g_unifrac_alpha) * length;
+                dm_stripe[j] += sum_pow1 * (sub1 / sum1);


Could sum1 ever be zero i.e. u1 and v1 being zero?

It's implicit and originally the zero check was done on sum1. The reason this should never happen is that, if sum1 is zero then vaw must be zero as well.

ElDeveloper · 2017-08-21T08:16:56Z

sucpp/unifrac_task.hpp

+       unsigned int tid;            // thread ID
+       double g_unifrac_alpha;      // generalized unifrac alpha
+    };
+


This is going to need some documentation at some point ... 😟

ElDeveloper · 2017-08-21T18:19:43Z

sucpp/unifrac_task.cpp

+                double sum1 = (u1 + v1) / vaw;
+                double sub1 = fabs(u1 - v1) / vaw;
+                double sum_pow1 = pow(sum1, task_p->g_unifrac_alpha) * length;
+                dm_stripe[j] += sum_pow1 * (sub1 / sum1);


wasade added 21 commits August 15, 2017 16:24

Test for NULL on malloc...

79f8ab5

Test for file existence

96db81c

Cleaning up warning messages

b82cb31

ENH: generalized unifrac

fe30abf

Merge branch 'higher_stack_threading' into g_unifrac

7e8cd6a

Expose via q2

a99808c

TST: d^0 and d^0.5

23af923

Merge branch 'master' of github.com:wasade/q2-state-unifrac into g_un…

f007742

…ifrac

Missed import

b82b7e5

Refactoring

a98bf69

regression: task param boundaries were set wrong

2c083f2

Merge branch 'g_unifrac' into vaw

98ca65c

Refactor tasks to separate file

e3804bd

VAW methods

034cc4c

Citations

eb904a3

Expose VAW

a91850c

travis c++ needs another include

97232c6

syntax...

2b12261

wrong param name

0993406

Merge branch 'master' of github.com:wasade/q2-state-unifrac into vaw

9425e6a

Missing a merge conflict

be69604

wasade mentioned this pull request Aug 17, 2017

Metagenomic unifrac #22

Closed

Fix param

2d30c62

wasade added 4 commits August 18, 2017 15:41

Merge branch 'master' of github.com:wasade/q2-state-unifrac into vaw

7b5c105

Merge conflict error

b8e7997

Merge hell

aa72ae8

Not sure

a9df5fc

ElDeveloper approved these changes Aug 21, 2017

View reviewed changes

wasade added 2 commits August 21, 2017 08:26

Addressing @ElDeveloper's comments

619aeb9

Missed close of comment

6a4f1a6

ElDeveloper approved these changes Aug 21, 2017

View reviewed changes

ElDeveloper merged commit 0b4659d into master Aug 21, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Variance adjusted UniFrac #20

Variance adjusted UniFrac #20

wasade commented Aug 17, 2017

ElDeveloper commented Aug 18, 2017

wasade commented Aug 19, 2017

ElDeveloper left a comment

ElDeveloper Aug 21, 2017

wasade Aug 21, 2017

ElDeveloper Aug 21, 2017

wasade Aug 21, 2017

ElDeveloper Aug 21, 2017

wasade Aug 21, 2017

ElDeveloper Aug 21, 2017

wasade Aug 21, 2017

ElDeveloper Aug 21, 2017

wasade Aug 21, 2017

ElDeveloper Aug 21, 2017

ElDeveloper Aug 21, 2017

ElDeveloper Aug 21, 2017

wasade Aug 21, 2017

ElDeveloper Aug 21, 2017

ElDeveloper Aug 21, 2017

wasade Aug 21, 2017

ElDeveloper Aug 21, 2017

Variance adjusted UniFrac #20

Variance adjusted UniFrac #20

Conversation

wasade commented Aug 17, 2017

ElDeveloper commented Aug 18, 2017

wasade commented Aug 19, 2017

ElDeveloper left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment