Incorrect results/errors for partitioned `over()` with nulls

## Expected results
Same as `polars`:

```py
import polars as pl
data = {"a": [1, 1, None, 2, 2], "b": [1, 3, 3, 2, 3], "i": [0, 1, 2, 3, 4]}

b = pl.col("b")
df = pl.DataFrame(data)

df.select(
 "i",
 b_min=b.min().over("a"),
 b_mean=b.mean().over("a"),
 b_first=b.first().over("a"),
 b_last=b.last().over("a"),
).sort("i").drop("i")
```

```
shape: (5, 4)
┌───────┬────────┬─────────┬────────┐
│ b_min ┆ b_mean ┆ b_first ┆ b_last │
│ --- ┆ --- ┆ --- ┆ --- │
│ i64 ┆ f64 ┆ i64 ┆ i64 │
╞═══════╪════════╪═════════╪════════╡
│ 1 ┆ 2.0 ┆ 1 ┆ 3 │
│ 1 ┆ 2.0 ┆ 1 ┆ 3 │
│ 3 ┆ 3.0 ┆ 3 ┆ 3 │
│ 2 ┆ 2.5 ┆ 2 ┆ 3 │
│ 2 ┆ 2.5 ┆ 2 ┆ 3 │
└───────┴────────┴─────────┴────────┘
```


## Repro
I discovered in #3295 that when we join back, the presence of `None` on the `partition_by` key(s) causes trouble:

```py
import narwhals as nw

b = nw.col("b")
df = nw.from_dict(data, backend="pyarrow")
df.select(
 "i",
 b_min=b.min().over("a"),
 b_mean=b.mean().over("a"),
 b_first=b.first().over("a"),
 b_last=b.last().over("a"),
).sort("i").drop("i").to_polars()
```

<details><summary>Show output</summary>


```
shape: (5, 4)
┌───────┬────────┬─────────┬────────┐
│ b_min ┆ b_mean ┆ b_first ┆ b_last │
│ --- ┆ --- ┆ --- ┆ --- │
│ i64 ┆ f64 ┆ i64 ┆ i64 │
╞═══════╪════════╪═════════╪════════╡
│ 1 ┆ 2.0 ┆ 1 ┆ 3 │
│ 1 ┆ 2.0 ┆ 1 ┆ 3 │
│ 2 ┆ 2.5 ┆ 2 ┆ 3 │
│ 2 ┆ 2.5 ┆ 2 ┆ 3 │
│ null ┆ null ┆ null ┆ null │
└───────┴────────┴─────────┴────────┘
```


</details> 



Our `pandas` impl raises on the same query:

```py
df = nw.from_dict(data, backend="pandas")
df.select(
 "i",
 b_min=b.min().over("a"),
 b_mean=b.mean().over("a"),
 b_first=b.first().over("a"),
 b_last=b.last().over("a"),
).sort("i").drop("i")
```

```
ShapeError: Expected object of length 5, got length: 4
```

The error can be avoided by removing `first`, `last` - but we still get incorrect results for the others:

```py
df = nw.from_dict(data, backend="pandas")
df.select("i", b_min=b.min().over("a"), b_mean=b.mean().over("a")).sort("i").drop(
 "i"
).to_polars()
```

<details><summary>Show output</summary>



```
shape: (5, 2)
┌───────┬────────┐
│ b_min ┆ b_mean │
│ --- ┆ --- │
│ f64 ┆ f64 │
╞═══════╪════════╡
│ 1.0 ┆ 2.0 │
│ 1.0 ┆ 2.0 │
│ null ┆ null │
│ 2.0 ┆ 2.5 │
│ 2.0 ┆ 2.5 │
└───────┴────────┘
```


</details> 



We *can* get the correct result for that part with `duckdb`:

```py
df = nw.from_dict(data, backend="polars")
df.lazy("duckdb").select("i", b_min=b.min().over("a"), b_mean=b.mean().over("a")).sort(
 "i"
).drop("i").collect("polars").to_polars()
```

<details><summary>Show output</summary>



```
shape: (5, 2)
┌───────┬────────┐
│ b_min ┆ b_mean │
│ --- ┆ --- │
│ i64 ┆ f64 │
╞═══════╪════════╡
│ 1 ┆ 2.0 │
│ 1 ┆ 2.0 │
│ 3 ┆ 3.0 │
│ 2 ┆ 2.5 │
│ 2 ┆ 2.5 │
└───────┴────────┘
```



</details> 

And by adding some `order_by`s, we can do the other two:

```py
df = nw.from_dict(data, backend="polars")
df.lazy("duckdb").select(
 "i",
 b_min=b.min().over("a"),
 b_mean=b.mean().over("a"),
 b_first=b.first().over("a", order_by="i"),
 b_last=b.last().over("a", order_by="i"),
).sort("i").drop("i").collect("polars").to_polars()
```

<details><summary>Show output</summary>


```
shape: (5, 4)
┌───────┬────────┬─────────┬────────┐
│ b_min ┆ b_mean ┆ b_first ┆ b_last │
│ --- ┆ --- ┆ --- ┆ --- │
│ i64 ┆ f64 ┆ i64 ┆ i64 │
╞═══════╪════════╪═════════╪════════╡
│ 1 ┆ 2.0 ┆ 1 ┆ 3 │
│ 1 ┆ 2.0 ┆ 1 ┆ 3 │
│ 3 ┆ 3.0 ┆ 3 ┆ 3 │
│ 2 ┆ 2.5 ┆ 2 ┆ 3 │
│ 2 ┆ 2.5 ┆ 2 ┆ 3 │
└───────┴────────┴─────────┴────────┘
```



</details>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Incorrect results/errors for partitioned `over()` with nulls #3300

Expected results

Repro

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Incorrect results/errors for partitioned over() with nulls #3300

Description

Expected results

Repro

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Incorrect results/errors for partitioned `over()` with nulls #3300