How does the locomotive collector collect the contents that are not loaded in the webpage?
Octopus collector is an internet data collector with comprehensive functions, simple operation and wide application range, which is different from locomotive collector. Octopus collector can collect the unloaded content in web pages through intelligent identification and flexible custom collection rules. The following are the general collection steps: 1. Open octopus collector and create a new collection task. 2. In the task settings, enter the URL to be collected as the starting URL for collection. 3. Configure collection rules. You can use the intelligent identification function to let Octopus automatically identify the data structure of the page, or you can set the collection rules manually. 4. If the collection rules are set manually, you can use the mouse to select the data elements on the page and set the corresponding collection rules to ensure the correct collection of the required data. 5. Set page turning rules. If you need to collect multiple pages of data, you can set octopus collector to automatically turn pages to get more data. 6. Run the acquisition task. After confirming the correct settings, you can start the collection task and let Octopus start collecting data. 7. Wait for the collection to be completed. Octopus will automatically capture the data on the page according to the set rules, and save it locally or export it to the designated database. Octopus collector has powerful data collection ability, which can help users to easily collect all kinds of web data. To learn more about the function and usage of Octopus Collector, please go to official website Tutorial and Help for more details.