Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

scanner runs very slow / no status #3296

Closed
nupis-FrankO opened this issue Nov 6, 2020 · 7 comments
Closed

scanner runs very slow / no status #3296

nupis-FrankO opened this issue Nov 6, 2020 · 7 comments
Labels
scanner About the scanner tool

Comments

@nupis-FrankO
Copy link

Hi,

we are testing ORT at the moment to scan/analyze a maven project.

The scan runs very slow (> 1 day) and we don't see any state of the current execution. Is it possible to implement a "progress bar"?

Or is there a possibility to optimize the execution of the scan process?

Regards,

Frank

@sschuberth
Copy link
Member

I have also seen bad performance for scanning large Maven projects recently, and we're planning to investigate these, as even with debug output sometimes nothing seem to happen for minutes. Meanwhile, can you try increasing the max heap size for ORT to something like 16 GB (-Xmx16g) to see if that helps?

@mnonnenmacher mnonnenmacher added the scanner About the scanner tool label Nov 9, 2020
@sachinshaji
Copy link

HI, We are also facing the same problem. Scanner takes lot of time.

@sschuberth
Copy link
Member

The scanner (i.e. ScanCode, by default) taking a lot of time is nothing ORT can change. However, we could probably try to do a better job at reporting progress (although you actually do get per package progress with --info).

In any case, you should set up a scan storage to benefit from existing scan results. Giving the ClearlyDefined scan storage a try is something you could do, for example.

@sachinshaji
Copy link

Thanks for the replay.
I have build a docker image form the Dockerfile given in the repo. I am triggering this command to scan the code
sudo docker run -v $PWD/:/project --info scan -i /project/analyse/analyzer-result.yml -o /project/scanner/scanner-result.json

We are not defining any storage backend, i guess this will default goes to filesystem.
Any suggestion to improve the scanning time? It took around 3 to 4 hours to scan the entire code.

@sschuberth
Copy link
Member

We are not defining any storage backend, i guess this will default goes to filesystem.

Correct. And this only speeds up the scanning for consecutive scans that involve mostly the same packages, as you're populating our file-based scan storage yourself.

Any suggestion to improve the scanning time?

The idea is that you either quickly build up some company-internal (Postgres-based) scan storage yourself to speed up future scans, or to use an existing public scan storage, like the one from ClearlyDefined as mentioned above. However, might be that we currently have an issue there.

@sachinshaji
Copy link

Thanks a lot for your help

@sschuberth
Copy link
Member

Closed as part of backlog grooming. Feel free to comment if you would like to contribute to this.

@sschuberth sschuberth closed this as not planned Won't fix, can't repro, duplicate, stale May 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
scanner About the scanner tool
Projects
None yet
Development

No branches or pull requests

4 participants