Best coverage subsets for three varying numbers of datasets

If we follow the plan to offer three different networks, namely one high-coverage with many languages and, say 300 concepts, one with less languages, but more concepts of, say 600 concepts, and one with the maximum we can get, we need to use the coverage code in lingpy to account for this.

This code is now straightforward, but the question is: do we still and actually need this, or do we rather just take the full dump of 2000 concepts? Given that we know the frequency of each concept IN CLICS, we can easily even visualize this by showing the size. And the communities still make sense, so far, we do not suffer from skewed data...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Best coverage subsets for three varying numbers of datasets #18

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Best coverage subsets for three varying numbers of datasets #18

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions