Lin Hsin Hsin on Spiders, Scrapers, Crawlers & Bots from Lin Hsin Hsin AI CENTER-- the first person in the world who authored the phenomenon of cryptocurrency Oct 3, 1996. An ENCRYPTION Specialist , Founder of FIRST VIRTUAL MUSEUM in the WORLD -- 27th Anniversary of LIN HSIN HSIN ART MUSEUM -- Digital Art Museum, First Virtual Museum in the World - 1994. Wikipedia, Digital Media Center: Technology, Digital Art, Digital Paintings, Digital Sculptures, Digital Music, Digital Musical Instruments, Sound, , Animated Music, Web-enabled, Interactive, Digital Media Poineer

Lin Hsin Hsin Artificial Intelligence Center

Spiders, Scrapers, Crawlers & Bots

What is a WEB CRAWLER?

Definition

A web crawler is a digital search engine BOT that uses:

🛠 copy & metadata
to discover & index web pages

Types of Web Crawlers by Functionality:

🛠 Focused web crawler
🛠 Incremental web crawlers
🛠 Distributed crawlers
🛠 Parallel crawlers
🛠 Deep web crawlers
🛠 Screen scrapers

Web Crawlers by Classifications:

Used by the search engine to:

🎯 Crawl websites
🎯 View images
🎯 View links
🎯 Index them on the internet

Commercial Bots

Used by some SEO websites to provide users with SEO reports of a selected website so as to solve any SEO issues on the site

Examples:

🕸️ Ahrefsbot by ahref.com
🕸️ SemrushBot by Semrush.com
🕸️ Barkrowler by Babbar.tech

Feed Fetchers Bots

Used to collect thumbnails & titles of the contents to display on their website

Examples:

Facebook external hit – used by the Facebook website
Twitter bot – used by Twitter

Monitoring Bots

Used to check the performance of the websites performances 🎯 uptime
🎯 pinback

What is a SPIDER?

Definition

A web spider is similar to a crawler but it is more focused on indexing the textual content of a web page Itnis deployed by search engines to scan & index the web

What is a SCRAPER?

Definition

📍 A web scraper is a program or script that EXTRACTS specific data from websites
📍 Unlike crawlers, which collect information about websites
scrapers are focused on the CONTENTS of the site, EXTRACTING:

📝 texts
🖼 images
📹 videos
🗣 audio
💲 prices
🎯 any other specific elements