Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Distributed Calculations #1

Open
JamesPHoughton opened this issue Jan 26, 2016 · 1 comment
Open

Distributed Calculations #1

JamesPHoughton opened this issue Jan 26, 2016 · 1 comment

Comments

@JamesPHoughton
Copy link
Collaborator

Optimization, Monte Carlo, etc. are all analyses that could benefit from running models in parallel.

We have a basic demonstration of parallel model fitting, where we fit the model independently to a lot of datasets: http://pysd-cookbook.readthedocs.org/en/latest/analyses/fitting/Massively_Parallel_Fitting.html

But we should show how these can be distributed across multiple computers. This is facilitated by the fact that pysd and its dependencies are pure-python, and can be pickled and distributed.

Here are a few resources for parallel computing in python:

Distribution frameworks:

  • multiprocessing
    • Part of standard python library
    • Runs easily in interactive mode
    • Basically limited to the cores on one host
  • celery
  • scoop
    • + Really clean API
    • - Doesn't play well with ipython/jupyter
  • jug
@JamesPHoughton JamesPHoughton changed the title Distributed Optimization Distributed Calculations Jan 29, 2016
@JamesPHoughton
Copy link
Collaborator Author

ipyparallel

Pros:

  • Works well with ipython

Cons:

  • Complicated setup

Resources:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant