You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Encountered an issue using the Abot2 package while crawling webpages. The httpResponse property of the crawled page (entity.CrawledPage.HttpResponseMessage) intermittently returns null for some pages.
Key Observations
This issue does not occur for all pages but only for certain ones.
The number of pages with a null httpResponse varies between different crawl runs.
Expected Behavior
The HttpResponseMessage should provide the HTTP response for all crawled pages.
Actual Behavior
The HttpResponseMessage is null for some pages, and the occurrence of these pages is inconsistent between crawl runs.
Additional Information
Error Message: When the issue occurs, the following error is logged:
The SSL connection could not be established, see inner exception.
Inner Exception: Unable to read data from the transport connection: An existing connection was forcibly closed by the remote host.
This issue appears to be related to handling HTTPS connections or certain server configurations.
Steps Already Taken
Verified SSL/TLS settings and configurations.
Checked network connectivity and ensured the target URLs are reachable.
Observed that this issue is not URL-specific but varies across crawl runs.
Executed crawler with different configuration, below are some config which have been used config 1:
Encountered an issue using the Abot2 package while crawling webpages. The httpResponse property of the crawled page (entity.CrawledPage.HttpResponseMessage) intermittently returns null for some pages.
Key Observations
This issue does not occur for all pages but only for certain ones.
The number of pages with a null httpResponse varies between different crawl runs.
Expected Behavior
The HttpResponseMessage should provide the HTTP response for all crawled pages.
Actual Behavior
The HttpResponseMessage is null for some pages, and the occurrence of these pages is inconsistent between crawl runs.
Additional Information
Error Message: When the issue occurs, the following error is logged:
The SSL connection could not be established, see inner exception.
Inner Exception: Unable to read data from the transport connection: An existing connection was forcibly closed by the remote host.
This issue appears to be related to handling HTTPS connections or certain server configurations.
Steps Already Taken
Verified SSL/TLS settings and configurations.
Checked network connectivity and ensured the target URLs are reachable.
Observed that this issue is not URL-specific but varies across crawl runs.
Executed crawler with different configuration, below are some config which have been used
config 1:
Config 2:
Would appreciate assistance in diagnosing and resolving this issue, or confirmation if this is a known bug
The text was updated successfully, but these errors were encountered: