Ticket #106 (closed enhancement: wontfix)
Merge parse and crawl commands
| Reported by: | pablo | Owned by: | pablo |
|---|---|---|---|
| Priority: | major | Milestone: | 0.10 |
| Component: | code | Version: | |
| Keywords: | Cc: | dan pablo |
Description (last modified by pablo) (diff)
There is a somewhat hacky parse command to fetch specific pages, parse them and show the results.
The problem with this parse command is that it runs outside the crawl loop, and thus it could return different results. For example, if a spider returns a particular requests, it won't follow then (when using the parse command).
We need to "integrate" better the parse command inside the engine crawling loop to provide results more similar to those that would be obtained by crawling with the crawl command.
Change History
Note: See
TracTickets for help on using
tickets.
