Releases: extractus/article-extractor
Releases · extractus/article-extractor
v7.2.17
- Merge pr #350 by @LarchLiu
- Add
agent
to fetchOptions
- Update CI to test with Node 20
- Update dependencies
- Update README
Example article extraction via proxy server with agent
import { extract } from '@extractus/article-extractor'
import { HttpsProxyAgent } from 'https-proxy-agent'
const proxy = 'http://abc:[email protected]:31113'
const url = 'https://www.cnbc.com/2022/09/21/what-another-major-rate-hike-by-the-federal-reserve-means-to-you.html'
const article = await extract(url, {}, {
agent: new HttpsProxyAgent(proxy),
})
console.log('Run article-extractor with proxy:', proxy)
console.log(article)
v7.2.16
- Fix issue #347
- Update dependencies
v7.2.15
- Merge with changes from pr #341
- Fix unsupported package
string-similarity
- Update deps
v7.2.14
- Add support parsely meta tags
Maybe it comes from Parse.ly. Our users found that serveral websites such as TheVerge start using the strange meta tags that may break the extraction process. With these non-standard resources, this release should be helpful.
v7.2.13
- Fix issue while fetching data from some websites (Deno platform only)
v7.2.12
- Set default user-agent
- Avoid error if
parserOptions
is null
- Update dependencies
v7.2.10
- Fix issue #331
- Update dependencies
- Remove unnecessary watermark
v7.2.9
- Fix issue #329
- Update dependencies
- Improve unit test
v7.2.8
- Expose new API method extractFromHtml()
- Update dependencies
- Change coding style (remove standardjs)
Related issues: #321, #326