-
Notifications
You must be signed in to change notification settings - Fork 464
[TransferEngine] Fix QP leak in EndPointStore #1132
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
mooncake-transfer-engine/src/transport/rdma_transport/worker_pool.cpp
Outdated
Show resolved
Hide resolved
| // if (!endpoint->active()) { | ||
| // if (endpoint->inactiveTime() > 1.0) | ||
| // context_.deleteEndpoint( | ||
| // entry.first); // enable for re-establishation | ||
| // for (auto &slice : entry.second) | ||
| // failed_slice_list.push_back(slice); entry.second.clear(); | ||
| // continue; | ||
| // } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The same
|
Does this PR resolve the limited QP number issue? |
This patch will eagerly destroy QPs if it is possible. So I think it's helpful to mitigate this issue. However Endpoint capacity is remained. |
|
Full review is required before merging this PR. |
Description
Type of Change
How Has This Been Tested?
This patch tries to mitigate problems from EndPoint Store management, including:
Simple test in local testbed.
Checklist