Web scraping specialist -Â
Â
Company Description
Â
Foresiet is an AI-enabled SaaS-based cybersecurity company that has developed an innovative One Click Digital Risk Protection platform. Our platform proactively detects, monitors, and secures identity, data, and asset threats from the surface, deep, and dark web. Combining human intelligence and applied research, we protect individuals, enterprises, and the Federal Government.
Â
Â
About The Role:
Â
As a Web Scraping specialist, you will play a pivotal role in building scalable high performance web scraper. You will leverage your expertise in building scalable web scraping solutions with your extensive experience with technologies such as Redis, Elasticsearch, Python, Scrapy, Selenium, Playwright to drive innovation and deliver high-quality products.
Â
What You'll DO:
- As a Web Scraping specialist focused on web scraping, you will build solutions for scraping data from various sources and processing big and unstructured data sets. You will be responsible for extracting and ingesting data from websites.
- Own the creation process of these tools, services, and workflows to improve crawl/scrape analysis, reports, and data management.
- Be responsible for testing the data and scrape to ensure accuracy, quality, and compliance.
What You'll Need:
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
- Experience with tools like BeautifulSoup, Selenium, Scrapy and headless browsing to automate intricate data extraction tasks, even from protected or dynamically-loaded web pages.
- Experience with automated bypassing of common web barriers by leveraging methodologies like proxy rotation, user agent and device fingerprint modification, and CAPTCHA challenges.
- Proven track record as a web scraping specializing in Python, Scrapy, Selenium, Playwright for at least 1-2 year.
- Extensive experience building security and network-related products is highly desirable.
- Hands-on experience with MongoDB, Redis, Elasticsearch, and other relevant technologies.
- Hands-on experience with Scrapy, Selenium, Playwright.
- Hands-on experience with background and parallel task processing and its related technologies,
- Familiarity with Linux operating system (Ubuntu/Debian).
- Familiarity with cloud platforms such as AWS, GCP, or Azure.
- Experience with Git, Docker and Ansible.
- Excellent communication, leadership, and collaboration skills.
- Experience working in an agile development environment.
- Ability to work independently and remotely.
Â
Connect us at [Confidential Information]