We have a combined experience of over eight years within custom web development and e-commerce software. For the last five years, our focus has mostly been around web data extraction and web analytics.


Some questions regarding setup of the Amazon Data Extraction Engine

Q: How many Amazon products can your system scrap per hour? I see that your system takes about 2.4 seconds per article. Is it possible to use another proxy to scrap articles even faster?

A: Yes. The speed is totally a matter of website capacity. You can put it into 100 threads, which will update 50 products per second. The software is designed to update price and stock as fast as you require. To run 100 threads, you will need a dedicated server.

Q: How long is the license when I buy it?

A: There are no license limitations. You get it and it works on your server. We provide new versions as long as the project continues.

If you need customizations, we provide them for an additional price. Since the software is not simple and depends on the scraping websites structure, it usually requires some support. And most of our clients eventually go to a small monthly payment for a timely support. Or you can handle it with your own developers.

Q: Can the data be put in the mysql database? Or can the scheme be adapted to the other webshop?

A: The data is stored in the mysql database. We install the software on your server and you can get a direct access to the database. It is possible to build some connectors, which will deploy data for your webshop.

Q: Is there any way that you do not have to manually synchronize the product data? The first time there is only ASIN, I need not have to do it manually and download everything.

A: The data is synced by the cron script, so the product data is updated automatically. Yes, asin goes first; it will populate other data after the first synchronization.

Q: It takes a long time to update the products; would there be any way to update them by groups/categories in cron tasks?

A: You may add more cron commands to the tab to speed up the synchronization process. The link should be like this: {} The option a=x makes a delay in seconds before the process starts. This way we can start more processes every minute.

Q: I need to download the data four times a day. Can you explain how to do that? How can I schedule the automatic csv download to another server or ftp?

A: We can set it up in a frame of the service. We can create a controller, which can be triggered by cron and send data to a server. Or you can call the link which generates CSV files.

Q: When does an active product go to the state “move to active”?

A: "Move to active" is one of the sync statuses, and you control in which status the product is. When you import products, you can specify the status.

We have created several statuses to distinguish products. You can import products into “Move to active”; they will be synced once. You can review their prices and decide if you need them in Active state.

Active means full syncing state. For example, move to active products ignore parsing offers settings and sync only with product page.

Q: In what order do you synchronize products again? The oldest ones first?

A: Once a product is synced, it gets next_update_date field put into the future date. Once this date becomes past, the system can put it into the queue of the next update. If the queue moves fast enough, usually the queue is not big; i.e., parser updates products right in time. Regarding next_update_date, each product also has a field sync_speed. Some products might be updated more often, though this depends on how often each product is likely to change price and stock. If price and stock do not change, the product is updated with bigger delays.

What is Amazon Data Extraction Engine?

  • High performance tool for crawl, extract and parse Amazon content.
  • Reliable extraction of actual price and quantity
  • You can run several million queries every day.

Go to contentsBack to previous page