Request for supporting conv1d and pooling ops with explicit sharding. #28090

vdutor · 2025-04-17T13:08:26Z

Dear JAX Team,

We are working on a project that heavily utilizes 1D convolutional and pooling layers across sharded inputs. We would like to migrate to the new explicit sharding api, but found jax.lax.conv_general_dilated and jax.lax.reduce_window to be breaking for our use-case. Please find below the specific details of the operations with a minimal failing example:

import jax
jax.config.update('jax_num_cpu_devices', 8)  # Run with 8 devices.
import numpy as np
from jax.experimental.shard import reshard


mesh = jax.make_mesh((2, 4), ('x', 'y'), axis_types=(jax.sharding.AxisType.Explicit,)*2)

with jax.sharding.use_mesh(mesh):
  inputs = reshard(np.zeros((16, 128, 7)), jax.sharding.PartitionSpec('x', 'y'))
  # Conv1D across sharded y-axis:
  _ = jax.lax.conv_general_dilated(
      inputs,
      np.zeros((5, 7, 11)),
      window_strides=(1,),
      padding='SAME',
      feature_group_count=1,
      lhs_dilation=(1,),
      rhs_dilation=(1,),
      dimension_numbers=('NWC', 'WIO', 'NWC'),
  )
  
  # Max pooling along sharded y-axis.
  _ = jax.lax.reduce_window(inputs, -np.inf, jax.lax.max, (1,2,1), (1,2,1), 'SAME')

Would it be possible to support this in future releases?

Many thanks!

The text was updated successfully, but these errors were encountered:

This rule only works when rhs is fully replicated or rhs's mesh is empty (i.e. rhs is a numpy array or jnp.array). In this case, we just forward the sharding of lhs to the output (after making sure that the out_shape even divides the sharding) And since reduce_window is the exact same thing as the above case, do the same in it's sharding rule. Fixes #28090 PiperOrigin-RevId: 748736039

This rule only works when rhs is fully replicated or rhs's mesh is empty (i.e. rhs is a numpy array or jnp.array). In this case, we just forward the sharding of lhs to the output (after making sure that the out_shape even divides the sharding) And since reduce_window is the exact same thing as the above case (i.e. lhs sharded, rhs fully replicated), do the same in it's sharding rule. Fixes #28090 PiperOrigin-RevId: 748736039

This rule only works when rhs is fully replicated or rhs's mesh is empty (i.e. rhs is a numpy array or jnp.array). In this case, we just forward the sharding of lhs to the output (after making sure that the out_shape even divides the sharding) And since reduce_window is the exact same thing as the above case (i.e. lhs sharded, rhs fully replicated), do the same in it's sharding rule. Fixes jax-ml#28090 PiperOrigin-RevId: 751534065

vdutor added the enhancement New feature or request label Apr 17, 2025

yashk2810 self-assigned this Apr 17, 2025

copybara-service bot mentioned this issue Apr 24, 2025

Add conv_general_dilated sharding rule #28253

Merged

copybara-service bot closed this as completed in 1c3f4fa Apr 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request for supporting conv1d and pooling ops with explicit sharding. #28090

Request for supporting conv1d and pooling ops with explicit sharding. #28090

vdutor commented Apr 17, 2025

Request for supporting conv1d and pooling ops with explicit sharding. #28090

Request for supporting conv1d and pooling ops with explicit sharding. #28090

Comments

vdutor commented Apr 17, 2025