GitHub

Below is a general guide on how to use and run these two scripts, ExportThreadWithoutReplies.py and ExportReplies.py. Both scripts interact with the Slack API to retrieve data from a specific Slack channel, but each focuses on different parts of the conversation.

1. Prepare Your `config.json`

Create a config.json file in the same directory as the scripts with the following format:

{
    "SLACK_TOKEN": "xoxb-1234-...",
    "SLACK_COOKIE": "YourCookieStringHere",
    "CHANNEL_ID": "CXXXXXX"
}

Where to find these values:

Channel ID
- Open Slack (the web version) and navigate to the channel you’re interested in.
- Look at the URL in your browser, for example:
```
https://app.slack.com/client/TXXXXXX/CYYYYYYY
```
  The part after /client/ and TXXXXXX/ is CYYYYYYY. That’s the Channel ID.
- Alternatively, if you see a link in the form https://slack.com/app_redirect?channel=CYYYYYYY, then CYYYYYYY is the Channel ID.
Slack Token and Cookie
- Open your browser’s Developer Tools → Network tab.
- Navigate (or refresh) the Slack channel to see all network requests.
- Look for a request containing conversations.history or any Slack API endpoint.
- Right-click on the request → Copy as cURL.
- Paste the cURL command into Postman or a text editor to inspect its headers.
- In the headers, find Authorization: Bearer xoxb-... (this is your Slack Token).
- Find the Cookie: ... header value (this is your Slack Cookie).
- Copy both into your config.json as shown above.

2. ExportThreadWithoutReplies.py – Fetch Main Thread Messages

What does it do?
- This script fetches main thread messages (messages with reply_count > 0, meaning they have replies) from the given Slack channel.
- It saves the results in a file named threads.json.
How to run it:
1. Make sure your config.json file is properly filled in (Token, Cookie, Channel ID).
2. Open a terminal/command prompt in the same directory as ExportThreadWithoutReplies.py and run:
```
python ExportThreadWithoutReplies.py
```
3. The script will connect to Slack and gather all thread-starting messages.
4. When finished, you should see a new file called threads.json in your directory.
Output Files:
- threads.json: Contains a list of JSON objects, each representing a main thread message (including ts, text, reply_count, etc.).

3. ExportReplies.py – Fetch Replies to Each Thread

What does it do?
- This script uses the threads.json file created by ExportThreadWithoutReplies.py to find each thread’s ts (timestamp).
- Then, for each thread, it retrieves all replies (the messages inside that thread).
- It stores those replies in replies.json and also keeps track of progress in a file called progress.json (so it can resume if interrupted).
How to run it:
1. Run it only after you have successfully run ExportThreadWithoutReplies.py (so you have threads.json).
2. In a terminal/command prompt, run:
```
python ExportReplies.py
```
3. The script loads threads.json, iterates over each thread, and fetches replies from Slack.
4. Replies are saved to replies.json. Progress is logged in progress.json.
Output Files:
- replies.json: Contains all fetched replies from all threads in the channel.
- progress.json: Tracks the last processed thread index.
  - If you re-run the script, it checks progress.json to skip re-fetching.
  - At the end of a successful run, it may reset to 0 (depending on the version of the script you have) so you can safely export another channel if needed.

4. Workflow Summary

Configure: Ensure config.json is filled out correctly with your Slack Token, Cookie, and Channel ID.
Run ExportThreadWithoutReplies:
```
python ExportThreadWithoutReplies.py
```
- Wait for it to finish.
- Check that threads.json has been created and contains your thread data.
Run ExportReplies:
```
python ExportReplies.py
```
- This will read threads.json and fetch all replies (per thread).
- Check replies.json for the collected reply data.
- If you stop it halfway through, re-run to resume from where it left off.
Check your files:
- threads.json contains the main threads.
- replies.json contains all the replies for those threads.

5. Troubleshooting

Authentication Errors:
- If you receive 401/403 errors or Slack says “invalid_auth”, re-check your Slack Token and Cookie in config.json.
Rate Limits (429):
- The script automatically handles Slack’s rate limiting by waiting when it detects a 429 error. If Slack does not send a Retry-After header, the script will print a message so you can wait manually.
Missing Data:
- Make sure your Slack user account has the necessary permissions for reading conversation history in that channel.

By following these steps, you’ll export Slack threads (main messages) and all of their replies into JSON files.

🆕 New Features v2.0

1. Automatic `since_ts` Tracking

Reads the last processed message’s ts value from since_ts.txt.
If the file is missing or empty, pulls the entire history on first run.
After fetching, writes the highest ts back to since_ts.txt.
Ensures each subsequent run only processes newer messages.

2. Parallel Permalink & Reply Retrieval

Uses concurrent.futures.ThreadPoolExecutor to fire up to 10 concurrent chat.getPermalink calls for thread-starter messages.
Replies fetching script uses up to 5 parallel workers against conversations.replies.
Results are sorted by ts to preserve chronological order.

3. Robust Rate-Limit Handling

On HTTP 429 (rate-limit), reads Retry-After header and sleeps before retrying.
Applies this logic uniformly across conversations.history, chat.getPermalink and conversations.replies endpoints.
Prevents hard failures on large channels or big threads.

4. Resume-able Progress Tracking

Replies script records “last processed index” in progress.json after each thread.
On restart, resumes from that index—no need to re-fetch already handled threads.
Once all threads are done, progress.json resets to zero for the next full export.

5. File Outputs

Threads:
- threads.json (full JSON dump)
- Optional: threads.csv for spreadsheet-friendly output
Replies:
- replies.json (accumulated thread replies)
Tracker Files:
- since_ts.txt (last ts checkpoint)
- progress.json (reply-fetch index)

Example Workflow

First Run:
- since_ts.txt is empty or missing → fetch all threads via conversations.history.
- Write the newest ts into since_ts.txt.
Subsequent Runs:
- Read since_ts → fetch only threads newer than that value.
- Replies script picks up from its progress.json checkpoint.
Inspecting Results:
- Use threads.json & replies.json to see a fully chronological export of all threads and their replies.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
ExportReplies.py		ExportReplies.py
ExportThreadWithoutReplies.py		ExportThreadWithoutReplies.py
README.md		README.md
config.json		config.json
dataTimeConverter.py		dataTimeConverter.py
since_ts.txt		since_ts.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

1. Prepare Your `config.json`

Where to find these values:

2. ExportThreadWithoutReplies.py – Fetch Main Thread Messages

3. ExportReplies.py – Fetch Replies to Each Thread

4. Workflow Summary

5. Troubleshooting

🆕 New Features v2.0

1. Automatic `since_ts` Tracking

2. Parallel Permalink & Reply Retrieval

3. Robust Rate-Limit Handling

4. Resume-able Progress Tracking

5. File Outputs

Example Workflow

About

Uh oh!

Releases

Packages

Languages

kemalaltun/AltunSlackExporter

Folders and files

Latest commit

History

Repository files navigation

1. Prepare Your config.json

Where to find these values:

2. ExportThreadWithoutReplies.py – Fetch Main Thread Messages

3. ExportReplies.py – Fetch Replies to Each Thread

4. Workflow Summary

5. Troubleshooting

🆕 New Features v2.0

1. Automatic since_ts Tracking

2. Parallel Permalink & Reply Retrieval

3. Robust Rate-Limit Handling

4. Resume-able Progress Tracking

5. File Outputs

Example Workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Prepare Your `config.json`

1. Automatic `since_ts` Tracking

Packages