Web scrapping from Korean Brand website - http://kdtj.kipris.or.kr/kdtj/searchLogina.do?method=loginTM#page10
node.js
and puppeteer
- Clone repository from git
npm install
to install dependenciesnpm run start
to run node.js server
Data will be saved as brands/[pagenumber].pdf
- Pagination skip every 10 pages
- Wait for image downloaded to the browser cache
- Start from specific page
- Error handling - extraction failed should stop process and notify to user