Skip to content

Apify actor-crawler providing an API for the transfermarkt.com website.

License

Notifications You must be signed in to change notification settings

lewis-conroy/actor-transfermarkt

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

act-transfermarkt

Apify actor-crawler providing an API for the transfermarkt.com website.

This actor can extract data from transfermarkt.com pages. It works best with competition, club or player pages, but is also capable of extracting some data from most (if not all) other page types. It requires only the start page url, it will determine the page type by itself and extract the data accordingly.

INPUT

Input is a JSON object with the following properties:

{ 
    "startUrl": START_URL,
    "parallels": PARALLEL_CRAWLERS,
    "crawlDepth": MAX_CRAWLING_DEPTH,
    "pageDepth": MAX_PAGINATION_DEPTH,
    "puppeteerOptions": LAUNCH_PUPPETEER_OPTIONS
}

startUrl is the only required attribute. This is the start page URL.
parallels specifies how many parallel crawlers will be used, defaults to 1.
crawlDepth defines how deep the crawler will navigate from current page, by default it will extract only the start page. pageDepth defines how many pages in the pagination it will navigate to, by default unlimited.
puppeteerOptions is a PuppeteerCrawler parameter launchPuppeteerOptions.

About

Apify actor-crawler providing an API for the transfermarkt.com website.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 93.9%
  • Dockerfile 6.1%