fix: minor fixes in some notebooks

BeaMarton13 · BeaMarton13 · commit 399c6fa60182 · 2025-08-29T17:00:30.000+03:00
diff --git a/doc/source/community_detection_guide/notebooks/community_detection_algorithms.ipynb b/doc/source/community_detection_guide/notebooks/community_detection_algorithms.ipynb
@@ -8,7 +8,7 @@
     "# __Community detection algorithms__\n",
     "## __Optimization based methods: modularity maximization__\n",
     "Modularity maximization methods are a prominent class of algorithms in community detection that aim to discover partitions of a network by optimizing a specific quality function called modularity.\n",
-    "<div style=\"background-color: #e6ffe6; padding: 20px; border-radius: 5px;\">\n",
+    "<div style=\"background-color: #e6ffe6; padding: 0px; border-radius: 5px;\">\n",
     "    \n",
     "**NOTE:** You can find a more detailed explanation of **modularity** [here](./modularity.ipynb).\n",
     "\n",
@@ -92,7 +92,7 @@
     "\n",
     "### __community_fluid_communities__\n",
     "\n",
-    "When is community_fluid_communities applied?\n",
+    "#### When is community_fluid_communities applied?\n",
     "\n",
     "The Fluid Communities algorithm is typically applied when:\n",
     "\n",
@@ -107,7 +107,7 @@
     "\n",
     "### __community_edge_betweenness (The Girvan-Newman Algorithm)__\n",
     "\n",
-    "When is community_edge_betweenness applied?\n",
+    "#### When is community_edge_betweenness applied?\n",
     "\n",
     "The Edge Betweenness (Girvan-Newman) algorithm is typically applied when:\n",
     "\n",
@@ -122,7 +122,7 @@
     "\n",
     "### __community_label_propagation__\n",
     "\n",
-    "When is community_label_propagation applied?\n",
+    "#### When is community_label_propagation applied?\n",
     "\n",
     "The Label Propagation Algorithm (LPA) is typically applied when:\n",
     "\n",
diff --git a/doc/source/community_detection_guide/notebooks/initial_workflow.ipynb b/doc/source/community_detection_guide/notebooks/initial_workflow.ipynb
@@ -552,11 +552,11 @@
     "\n",
     "* **Note on local seeds:** Many `igraph` algorithms, such as the Leiden algorithm, rely on random processes. To ensure a specific part of your analysis is reproducible without affecting the rest of your notebook, you can use a custom utility function, such as `local_random()`, imported from [here](./functions.ipynb).\n",
     "\n",
-    "    For example:\n",
-    "    ```python\n",
-    "    with local_random(seed=123):\n",
-    "        g.community_leiden()\n",
-    "    ```\n",
+    "For example:\n",
+    "```python\n",
+    "with local_random(seed=123):\n",
+    "    g.community_leiden()\n",
+    "```\n",
     "\n",
     "</div>"
    ]
diff --git a/doc/source/community_detection_guide/notebooks/membership_vector.ipynb b/doc/source/community_detection_guide/notebooks/membership_vector.ipynb
@@ -7,15 +7,6 @@
    "source": [
     "# Membership vector\n",
     "\n",
-    "### What is a __membership vector__?\n",
-    "A membership vector is a list or array that assigns a cluster or group identifier to each data point or object.\n",
-    "\n",
-    "* __Structure:__ It's a one-dimensional sequence where the length is equal to the number of data points.\n",
-    "\n",
-    "* __Indexing:__ Each index in the vector corresponds to a specific data point. For example, the value at index i is the cluster ID for the i-th data point.\n",
-    "\n",
-    "* __Values:__ The values in the vector are the cluster IDs. These are typically non-negative integers (e.g., 0, 1, 2, 3, ...).\n",
-    "\n",
     "### Karate club network: A case study in `igraph`\n",
     "Let's apply the concept of a membership vector to the famous Zachary's Karate Club network. \n",
     "\n",
@@ -87,7 +78,7 @@
    "id": "0160c803-fa2a-4dd4-802c-8ce94df9051e",
    "metadata": {},
    "source": [
-    "### Why it's useful\n",
+    "### Why is it useful?\n",
     "\n",
     "The membership vector is the most direct and compact representation of a clustering. It serves as the basis for almost all subsequent analyses and visualizations:\n",
     "\n",
diff --git a/doc/source/community_detection_guide/notebooks/modularity.ipynb b/doc/source/community_detection_guide/notebooks/modularity.ipynb
@@ -5,7 +5,8 @@
    "id": "94f8164e-537f-41a1-bfc4-ed15c7b00cf8",
    "metadata": {},
    "source": [
-    "# Modularity formula\n",
+    "# Modularity\n",
+    "## Modularity formula\n",
     "\n",
     "Modularity is a quantitative metric used to evaluate the strength of a network's division into modules (or communities). It measures how well the network is partitioned by comparing the density of edges within communities to the expected density of such edges in a randomized network that preserves the original degree distribution. The formula for modularity is given below.\n",
     "\n",
@@ -36,7 +37,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 2,
+   "execution_count": 1,
    "id": "a4766cc0-3157-493f-a072-9fd87ed92519",
    "metadata": {},
    "outputs": [
@@ -100,7 +101,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 2,
    "id": "eccd7d47-1606-4fc8-ac0c-31ad49604f0b",
    "metadata": {},
    "outputs": [
@@ -127,7 +128,7 @@
    "metadata": {},
    "source": [
     "### Modularity calculation\n",
-    "### Computation for \"good\" partitioning ($P_{good}$)\n",
+    "### Computation for \"good\" partitioning ($P_\\text{good}$)\n",
     "\n",
     "This partition correctly identifies the two cliques.\n",
     "\n",
@@ -154,9 +155,38 @@
     "\n",
     "**Final modularity** ($Q_{good}$):\n",
     "$Q_{good} = \\frac{1}{2m} \\times (\\text{Total Sum}) = \\frac{1}{14} \\times 5 = \\frac{5}{14} \\approx \\mathbf{0.357}$\n",
+    "\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "id": "b6de50ca-146b-4323-9828-80d883d2d8e9",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "0.3571428571428571"
+      ]
+     },
+     "execution_count": 3,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "membership_good = [0, 0, 0, 1, 1, 1]\n",
+    "g.modularity(membership_good)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "8bdbc1dc-5c26-4b33-b5f4-072fee61fc7f",
+   "metadata": {},
+   "source": [
     "\n",
-    "\n",
-    "### 2. Computation for \"bad\" partitioning ($P_{bad}$)\n",
+    "### Computation for \"bad\" partitioning ($P_\\text{bad}$)\n",
     "\n",
     "This partition incorrectly splits a clique and merges nodes from both communities.\n",
     "\n",
@@ -186,14 +216,36 @@
     "$Q_{bad} = \\frac{1}{2m} \\times (\\text{Total Sum}) = \\frac{1}{14} \\times (-3) = -\\frac{3}{14} \\approx \\mathbf{-0.214}$"
    ]
   },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "894ddb6b-25d5-4060-be09-5758f2d3db45",
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "-0.2142857142857143"
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "membership_bad = [0, 0, 1, 0, 1, 1]\n",
+    "g.modularity(membership_bad)"
+   ]
+  },
   {
    "cell_type": "markdown",
    "id": "f3981bc5-f6bf-4b41-a168-cae9c17ec764",
    "metadata": {},
    "source": [
     "*Note:* Based on our previous analysis, the \"good\" partitioning yields a significantly higher modularity score. It is important to note, however, that a high modularity score is not always a definitive indicator of a better community partitioning, as was previously demonstrated with the Grid Graph [here](test_significance_of_community.ipynb).\n",
     "\n",
-    "# Directed Modularity\n",
+    "## Directed modularity\n",
     "\n",
     "While the classic modularity formula works for undirected networks, a different approach is needed for **directed networks**, where edges have a specific direction (e.g., from node *i* to node *j*). In this context, the direction of an edge is crucial and should not be ignored.\n",
     "\n",
@@ -213,7 +265,7 @@
     "* **Flipping a single edge:** Reversing a single edge (e.g., from A → B to B → A) will change the modularity score. This is because the out-degree of A and the in-degree of B would change, altering the null model's calculation and, consequently, the overall score.\n",
     "* **Flipping all edges:** If you reverse the direction of **every single edge** in the network, the modularity score will **remain the same**. This is due to a symmetry property of the formula. The set of in-degrees becomes the new set of out-degrees, and vice versa. When the formula is applied to this completely reversed network, the total modularity score is unchanged. This is a fascinating property of directed modularity.\n",
     "\n",
-    "# From directed to undirected formula\n",
+    "## From directed to undirected formula\n",
     "Start with the directed formula:\n",
     "\n",
     "$$Q = \\frac{1}{m} \\sum_{i,j} \\left[ A_{ij} - \\gamma \\frac{k_i^\\text{out} k_j^\\text{in}}{m} \\right] \\delta(c_i, c_j)$$\n",
@@ -232,7 +284,7 @@
     "\n",
     "\n",
     "\n",
-    "# Why the resolution parameter is important\n",
+    "## Why the resolution parameter is important\n",
     "\n",
     "The resolution parameter addresses a fundamental limitation of the original modularity measure, known as the **\"resolution limit\"**. This is the tendency of the original formula (where $\\gamma=1$) to fail at detecting small communities, especially in large graphs. It often merges smaller, distinct communities into a single larger one to maximize the modularity score.\n",
     "\n",
@@ -242,7 +294,8 @@
     "* $\\gamma < 1$: Decreasing the resolution parameter reduces the penalty. This allows the algorithm to find **more and smaller communities**, as it becomes easier for closely-knit groups to be identified as their own communities.\n",
     "\n",
     "In essence, the resolution parameter provides a flexible way to explore the community structure of a network at different scales, moving beyond the limitations of a single, fixed-scale partition.\n",
-    "# Density-based modularity for undirected graphs\n",
+    "\n",
+    "## Density-based modularity for undirected graphs\n",
     "\n",
     "While modularity is a powerful metric, it suffers from a well-known flaw called the **resolution limit**. This problem causes the modularity-maximizing algorithm to fail to detect small, tightly-knit communities, especially in large networks. Instead of finding these small groups, it often merges them into a single larger one to maximize the modularity score.\n",
     "\n",
@@ -270,14 +323,6 @@
     "\n",
     "In this formulation, the null model assumes **uniform edge probability**, so communities are favored if their **internal density** is higher than the global density.\n"
    ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "1b9c3fa0-05b7-461f-acf8-8348f344a2d9",
-   "metadata": {},
-   "outputs": [],
-   "source": []
   }
  ],
  "metadata": {
diff --git a/doc/source/community_detection_guide/notebooks/resolution.ipynb b/doc/source/community_detection_guide/notebooks/resolution.ipynb
diff --git a/doc/source/community_detection_guide/notebooks/test_significance_of_community.ipynb b/doc/source/community_detection_guide/notebooks/test_significance_of_community.ipynb
@@ -270,7 +270,7 @@
    "id": "94db88ee-12f7-4e68-a2ab-d76136a6d983",
    "metadata": {},
    "source": [
-    "## Testing Significance of Community Structure on a Grid Graph"
+    "## Testing significance of community structure on a grid graph"
    ]
   },
   {
@@ -427,14 +427,6 @@
     "\n",
     "plot_nmi_histogram(er_graph, pairwise_nmi_values, title)"
    ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": null,
-   "id": "54dc19c6-8e76-491d-bc20-b76b08b128bd",
-   "metadata": {},
-   "outputs": [],
-   "source": []
   }
  ],
  "metadata": {

Original file line number	Diff line number	Diff line change
`@@ -270,7 +270,7 @@`
`270`	`270`	`"id": "94db88ee-12f7-4e68-a2ab-d76136a6d983",`
`271`	`271`	`"metadata": {},`
`272`	`272`	`"source": [`
`273`		`- "## Testing Significance of Community Structure on a Grid Graph"`
	`273`	`+ "## Testing significance of community structure on a grid graph"`
`274`	`274`	`]`
`275`	`275`	`},`
`276`	`276`	`{`
`@@ -427,14 +427,6 @@`
`427`	`427`	`"\n",`
`428`	`428`	`"plot_nmi_histogram(er_graph, pairwise_nmi_values, title)"`
`429`	`429`	`]`
`430`		`- },`
`431`		`- {`
`432`		`- "cell_type": "code",`
`433`		`- "execution_count": null,`
`434`		`- "id": "54dc19c6-8e76-491d-bc20-b76b08b128bd",`
`435`		`- "metadata": {},`
`436`		`- "outputs": [],`
`437`		`- "source": []`
`438`	`430`	`}`
`439`	`431`	`],`
`440`	`432`	`"metadata": {`