-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Errors when running crawler behind corporate proxy #1094
Comments
It seems like this is happening when the crawler uses the proxy to connect to the api. We probably need a different variable name for that case. |
That what we set as a proxy setting:
|
Could you try removing the HTTP_PROXY variable, it should be the one used for the connections from the crawler to the api. |
Ok, tried, looks like the connection can be established now, but getting SSL errors: Any hints on configuring SSL certs for crawler (since there is a replacement of SSL cert with our org signed certificate, when going through the proxy) or maybe any way to run crawler in insecure mode? |
Alright, thanks. SSL is implemented by openssl, so setting |
gotcha, thank you. I'll work on that tomorrow, since will need to update a deployment yaml and mount ssl certs somewhere as a secret |
The related change is merged. New container image should be published soon. https://github.com/change-metrics/monocle/actions/runs/7199715334 |
Hello, I added a certificate to a deployment and set the env var to:
Then tested with curl and connection works fine via proxy:
But crawler still receives the following: Is it possible set it to insecure? Appreciate any further suggestions |
Perhaps you can try setting |
ah, looks like it is TLS_NO_VERIFY variable, as per: https://github.com/change-metrics/monocle/blob/659e4c319b3b6c37777ae692952c7250448e7319/src/Monocle/Client.hs#L47C28-L47C41 |
Any idea why I get the a "Network error" from web UI (api), when trying to access it via browser Logs of api service not throwing any suspicious errors and moreover it that I received 200: I've exposed the service via Cloud Load Balancer on GCP GKE, with LoadBalancer service type:
|
Have you try setting |
yep, set that for api and crawler, but still getting the same "Network error" message |
Oops I meant |
Sorry can't give you a screenshots, but while looking into Chrome developer tools, I see the following for "about" request:
Request Headers:
|
Awesome, that did the trick. Thank you very much @TristanCacqueray ! Feel free to close this issue record |
You're welcome, have fun! |
Hello,
Thank you for helping out with a question about proxy settings for crawler. Now when I'm trying to run a test indexing, I'm getting the following error:
2023-12-12 21:01:48 WARNING Macroscope.Main:317: Skipping due to an unexpected exception {"index":"test","crawler":"coder","err":"Decoding of CommitInfoRequest {commitInfoRequestIndex = "test", commitInfoRequestCrawler = "coder", commitInfoRequestEntity = Enumerated {enumerated = Right EntityTypeENTITY_TYPE_ORGANIZATION}, commitInfoRequestOffset = 0} failed with: "Error in $: Failed reading: not a valid json value at '<!DOCTYPEhtmlPUBLIC-W3CDTDXHTML1.0TransitionalENhttp:www.w3.orgTRxhtml1DTDxhtm'"\nCallStack (from HasCallStack):\n error, called at src/Relude/Debug.hs:289:11 in relude-1.2.0.0-Jiwa4gfuZvkK1snRof3V:Relude.Debug\n error, called at src/Monocle/Client.hs:107:17 in monocle-0.1.10.0-1juCsBb4vJ35WvYo0D138g:Monocle.Client"}
Here is a config:
workspaces:
- name: test
crawlers:
- name: "coder"
provider:
github_organization: coder
update_since: '2023-01-01'
Any idea what that would mean?
Appreciate any hints
The text was updated successfully, but these errors were encountered: