-
Hello, We are beginners with lychee and we would need a bit of advice. We started to add the GitHub action in our repository which mostly contains jupyter notebooks (in case you're not familiar, it's basically a list of "cells", each cell containing python code and the result of its evaluation, stored in a JSON document) and Markdown files. It helped us to spot a few dead links and some that could be upgraded to HTTPS. However, there are also some false positives (see gw-odw/odw#26) and I would like to know the best way to silence them. We have 3 main categories:
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Welcome to lychee! 👋 I took a look at the repo and can see the false positives. At the moment, we extract links from Jupyter Notebook files using a simple plaintext extractor, which explains the false-positives. For deeper inspection, we'd need to parse the file format. There is a crate for that! I've opened #1659 for that. In the meantime, you can exclude the broken links with a custom # Issues with placeholder links in Jupyter Notebook files.
# Remove once https://github.com/lycheeverse/lychee/issues/1659 got resolved.
https://dcc.ligo.org/public/0182/T2200137/001/O3bPE_downsampled.tar.gz/n
https://dcc.ligo.org/public/0157/P1800370/005/GW150914_GWTC-1.hdf5/n
https://raw.githubusercontent.com/gw-odw/odw/main/Tutorials/Day_3/toy_model.csv/n
https://github.com/gw-odw/odw/raw/main/Tutorials/Day_2/Data/PyCBC_T2_1.gwf/n
https://github.com/gw-odw/odw/raw/main/Tutorials/Day_2/Data/PyCBC_T2_2.gwf/n
https://github.com/gw-odw/odw/raw/main/Tutorials/Day_2/Data/PyCBC_T3_0.gwf/n
# Legitimate exclude; not a bug.
https://lscsoft.docs.ligo.org/bilby/examples.html
# In the document, `%7B%7D` this is actually `{}`. This is hard to fix with lychee since it can't understand placeholders.
# So this is another valid exclude.
https://github.com/gw-odw/odw/raw/main/Tutorials/Day_2/Data/%7B%7D/
# Exclude all file-links. Workaround until https://github.com/lycheeverse/lychee/issues/1646 is resolved
file:// A few things I like to do:
Hope that helps. |
Beta Was this translation helpful? Give feedback.
Welcome to lychee! 👋
I took a look at the repo and can see the false positives.
At the moment, we extract links from Jupyter Notebook files using a simple plaintext extractor, which explains the false-positives. For deeper inspection, we'd need to parse the file format. There is a crate for that! I've opened #1659 for that.
In the meantime, you can exclude the broken links with a custom
.lycheeignore
: