Timeline


and .

10/06/08:

15:12 Changeset [309:316e231b660e] by elpolilla

Modified ItemDeltas? to work with RobustScrapedItems? instead of ...

15:00 Ticket #9 (Define Scrapy core components) closed by pablo
fixed: In the last meeting with the team we decided that the Scrapy core will …
13:19 Ticket #13 (Write homepage introduction) created by pablo
We need a introduction to Scrapy in our homepage. It should describe …
13:18 Ticket #12 (Define a mechanism for editing homepage pages) created by pablo
Define a mechanism to edit homepage pages. We're currently thinking …
13:15 Ticket #11 (Write usable basic spider class) closed by pablo
fixed: Added BasicSpider? in r301. It will probably need need some further …
13:12 Ticket #11 (Write usable basic spider class) created by pablo
Right now we only have the BaseSpider? class which doesn't provide any …
13:08 Ticket #10 (Add some more usable Link Extractors) closed by pablo
fixed: Added RegexLinkExtractor? in r302
13:06 Ticket #10 (Add some more usable Link Extractors) created by pablo
Add a more usable Link Extractor to define how a spider should following …
13:04 Ticket #9 (Define Scrapy core components) created by pablo
Define the core components of Scrapy and sort of the code so they remain …
12:59 Ticket #8 (Write scrapy tutorial for creating your first spider) created by pablo
Write a simple tutorial for a simple first spider. Perhaps scraping Google …
12:56 Ticket #7 (Write some basic adaptors) created by pablo
Write some basic adaptors for the most common tasks like extraction (from …
09:58 Changeset [308:29d0c1bbf719] by Andres Moreira <elkpichico@…>

Fixed bug for unicode support.The empty string () in some platforms is ...

08:14 Changeset [307:f87b1a2f1f70] by elpolilla

Added ItemDelta? objects and modified replays to make use of them

01:23 Changeset [306:d9d6230095ab] by Pablo Hoffman <pablo@…>

removed unneeded DEFAULT_DATA_ENCODING and commented COMMANDS_MODULE in ...

01:22 Changeset [305:4597b9e1c258] by Pablo Hoffman <pablo@…>

added setadaptors method to ScrapedItem?, removed incorrect constructor

00:36 Changeset [304:53e7d3f55944] by Pablo Hoffman <pablo@…>

removed unused imports and minor bug fix to media/image pipeline

10/05/08:

16:32 Changeset [303:e01a349cffcc] by samus_

removing utf-16 xpathselector_iternodes testcase since the problem comes ...

05:57 Changeset [302:001204823c87] by Pablo Hoffman <pablo@…>

added RegexLinkExtractor? in new scrapy.link.extractors module

05:45 Changeset [301:dad9a5b25bd2] by Pablo Hoffman <pablo@…>

added scrapy.contrib.spiders module

05:39 Changeset [300:1fbebb30a107] by Pablo Hoffman <pablo@…>

added scrapy.utils.response module

05:37 Changeset [299:393ca306f33f] by Pablo Hoffman <pablo@…>

improved scrapy-admin.py script and default project template

05:35 Changeset [298:507eaefc2123] by Pablo Hoffman <pablo@…>

removed old comment

04:58 Changeset [297:2c3961f304ea] by Pablo Hoffman <pablo@…>

added nonzero method to XPathSelector

10/03/08:

16:50 Changeset [296:7c46a4dc7259] by Damian Canabal <damian.canabal@…>

added NotSupported? Exception

11:37 Changeset [295:2a219c647af0] by Andres Moreira <elkpichico@…>

Added test for utils/markup.py. Added support to unicode to the new markup ...

08:57 Changeset [294:7b8bf2670ca1] by Andres Moreira <elkpichico@…>

Added new functions to parse html.

01:14 Changeset [293:e8036f5f34ca] by Pablo Hoffman <pablo@…>

added another test for safe_url_string function

10/02/08:

19:41 Changeset [292:8d11ca2543c2] by olveyra

reverted an experimental code that should have been commited

17:39 Changeset [291:1f203db25492] by olveyra

added DOWNLOAD_DELAY comment in settings template

16:59 Changeset [290:1e28d7fdd693] by olveyra

added support for global DOWNLOAD_DELAY setting

16:22 Changeset [289:206880fae84a] by olveyra

allow to directly specify which domain corresponds to a given request

09/30/08:

16:57 Changeset [288:6a7cc5a534c8] by Pablo Hoffman <pablo@…>

simplified test without loosing functionality

16:24 Changeset [287:f2736aab761e] by olveyra

added test for r284

13:56 Changeset [286:ea4294de0b24] by olveyra

better management of some redirection loops.

13:19 Changeset [285:688f07ba54d5] by olveyra

small fix

11:15 Changeset [284:87ac2d7adc10] by olveyra

safe_url_string should not escape unreserved marks (see RFC 2396, sec 2.3)

10:44 Changeset [283:fff1d4064041] by olveyra

- copied original request to response.request in get_url method - deleted ...

09/29/08:

15:14 Changeset [282:e7af2d20d017] by elpolilla

- Added the posibility of knowing the decompressed response's format, in ...

10:21 Changeset [281:149b8173258a] by Damian Canabal <damian.canabal@…>

rolled back public ent_re to private and added a function has_entities ...

09:52 Changeset [280:9716ad99f4b9] by Damian Canabal <damian.canabal@…>

changed private html entity regex to public

09/25/08:

09:49 Changeset [279:d1dd66016ea2] by samus_

added comment

09:45 Changeset [278:16ad4995eca2] by samus_

small fix to the regex

09:22 Changeset [277:9fb775f438c6] by Daniel Grana <dangra@…>

images: use brief exception logging

09/24/08:

23:14 Changeset [276:6abe09ae4b61] by olveyra

allow to override item class adaptor in constructor

15:19 Changeset [275:13f5b2b0e14e] by samus_

added support for xml-declared encodings

09:40 Changeset [274:fd20e0ee6a63] by Damian Canabal <damian.canabal@…>

added test for remove entities

09:25 Changeset [273:43df3a99077a] by Damian Canabal <damian.canabal@…>

html entities regexp improvement

09/23/08:

19:18 Changeset [272:798b3b5f1234] by Pablo Hoffman <pablo@…>

moved scrapy.core.log module to scrapy.log

18:52 Changeset [271:d1670998ba5c] by Pablo Hoffman <pablo@…>

moved scrapy.core.mail module to scrapy.mail

18:48 Changeset [270:26c4d9d1c6a8] by Pablo Hoffman <pablo@…>

added scrapy.utils.defer, moved deferred functions from scrapy.utils.misc ...

18:34 Changeset [269:fcde243f2228] by Pablo Hoffman <pablo@…>

removed location_str function incorrectly added to this module

17:36 Changeset [268:b69cb7ddaa8c] by samus_

removed duplicated function convert_entity from scrapy.utils.misc

08:46 Ticket #5 (Remove scrapy.utils.misc.unquote_html) closed by german
fixed: done in r267

09/22/08:

15:04 Changeset [267:cbd270dc632b] by german

removed unquote_html

09/19/08:

17:03 Ticket #4 (Deploy scrapy.org homepage) closed by daniel
fixed
17:00 Changeset [266:77249aed5c4c] by Matias Aguirre <matiasaguirre@…>

Change site-media with static in css file

16:57 Changeset [265:66f8d1a5c4e6] by Matias Aguirre <matiasaguirre@…>

Comment blog templatetags calls

16:53 Changeset [264:88099d4cc7f1] by Matias Aguirre <matiasaguirre@…>

Remove blog url an installed app setting

16:19 Changeset [263:f2e96b328f4e] by Matias Aguirre <matiasaguirre@…>

Remove blog application which is not django 1.0

16:17 Changeset [262:44fdb4735206] by Matias Aguirre <matiasaguirre@…>

Django 1.0 support over download application

13:57 Changeset [261:65dd393b0e25] by Daniel Grana <dangra@…>

remove decobot reference and fix test_plugin

13:57 Changeset [260:1da99f84f1f7] by Daniel Grana <dangra@…>

remove decobot reference

12:21 Changeset [259:7f8da8cfd8a3] by Daniel Grana <dangra@…>

grrr.. stupid bugfix

11:11 Changeset [258:d5a3b33c4d65] by eduardo

removed references to python2.5

09:58 Ticket #6 (Add spider argument to settings) created by pablo
Sometimes spiders need to override settings, but there is currently no way …
09:50 Changeset [257:a5f74b6dd472] by olveyra

reverted changeset 254

09/18/08:

17:19 Changeset [256:f1307eefb3f4] by olveyra

ouch! reverted a failed commit and now yes, removed an unneeded None

17:13 Changeset [255:01a553c266ca] by olveyra

removed unneeded None

17:11 Changeset [254:aa6d49861e0f] by olveyra

allow to set download delay as a scrapy setting

16:21 Changeset [253:4948d9208447] by eduardo

bump version

14:48 Changeset [252:aa5a86414c05] by eduardo

include templates

13:08 Changeset [251:dd31b26716e4] by Daniel Grana <dangra@…>

new setup.py

11:58 Changeset [250:89558d4c4488] by eduardo

in the way of setuptooling

09/17/08:

09:34 Changeset [249:a114d4344f4f] by Daniel Grana <dangra@…>

use request.deferred for cached valued too

09/16/08:

20:40 Changeset [248:2cab2509abaf] by Daniel Grana <dangra@…>

media: use deferred of request to allow adding callback from ...

20:40 Changeset [247:ffa384206046] by Daniel Grana <dangra@…>

add image pipeline that stores and can thumbs images in different sizes

20:39 Changeset [246:63ab4724fa90] by Daniel Grana <dangra@…>

move media_to_download hook to allow caching and prevent downloading

16:31 Changeset [245:19ac29207c2c] by Daniel Grana <dangra@…>

mediapipeline: change cache variable by domaininfo

11:18 Ticket #2 (Implement media pipeline) closed by daniel
fixed: implemented at …
11:03 Changeset [244:af3a21fc4eea] by Daniel Grana <dangra@…>

messy typo adding errback

08:56 Changeset [243:5cbce03f57d0] by anibal

Scrapy architecture diagram 1st version, dia source

08:30 Changeset [242:30d43e30d5e7] by Pablo Hoffman <pablo@…>

added docs dir, guess for what? :)

09/15/08:

12:37 Changeset [241:dabac5c48c3f] by Pablo Hoffman <pablo@…>

renamed scrapy.lib to scrapy.xlib

12:29 Changeset [240:eca8f9ca9e6d] by Pablo Hoffman <pablo@…>

removed simplejson from scrapy code

11:55 Ticket #5 (Remove scrapy.utils.misc.unquote_html) created by pablo
Remove unquote_html which whoose functionality is a duplicate of the …
10:55 Changeset [239:e23aaa70e373] by Daniel Grana <dangra@…>

ername methods and enforce extraction of Request objects as item media to ...

10:08 Changeset [238:e197360ad0fa] by Pablo Hoffman <pablo@…>

improved fix to support IE7 (thanks bill\!)

09:58 Changeset [237:004a147ccabe] by Pablo Hoffman <pablo@…>

fixed hack for IE6

09:56 Changeset [236:43994444805d] by Pablo Hoffman <pablo@…>

added hack for IE6

09:46 Changeset [235:272c0f22bca2] by Daniel Grana <dangra@…>

move bugtraps to avoid silence of bugs in media_downloaded or media_failed

05:57 Changeset [234:92302b2b8373] by Daniel Grana <dangra@…>

small bugfix and add bugtraps

05:27 Changeset [233:e8b316f3ec50] by Daniel Grana <dangra@…>

remove item.image_urls from get_urls_from_item

04:08 Changeset [232:986eeef678e0] by Daniel Grana <dangra@…>

add info object to public methods

03:01 Changeset [231:9efda0c9d303] by Daniel Grana <dangra@…>

Adds docstrings to public methods, remove unused imports, normalize urls ...

01:33 Changeset [230:04d10118f899] by Daniel Grana <dangra@…>

add media_to_download hook support, and return cached result if available

09/14/08:

23:34 Ticket #4 (Deploy scrapy.org homepage) created by pablo
Deploy scrapy.org Django homepage written by Matias.
23:32 Ticket #3 (Implement adaptor infrastructure) created by pablo
Implement the adaptors infrastructure and write some common basic …
23:30 Ticket #2 (Implement media pipeline) created by pablo
Implement media pipeline as discussed in our meetings, and based on the …
22:23 Changeset [229:03643a610439] by olveyra

removed reference to guid (decobot specific) attribute in item pipeline ...

09/12/08:

18:55 Changeset [228:4ff78c39d1bd] by Daniel Grana <dangra@…>

mediamiddleware: bugfixes and remove prints

17:48 Changeset [227:e1f13a264f52] by olveyra

- Support for not output in cluster status - schedule option by default ...

16:58 Changeset [226:ef90a278007e] by Daniel Grana <dangra@…>

add media pipeline skeleton

09/11/08:

13:35 Changeset [225:8a4c9af757a4] by Andres Moreira <elkpichico@…>

Fixed condition of new option --quiet.

12:33 Changeset [224:91a28bac3a72] by olveyra

- leave in adaptors.py only the elemental adaptors pipeline code. - moved ...

11:59 Changeset [223:8110e2ab3e4d] by Andres Moreira <elkpichico@…>

Fixed bug in the total items count. Added new option --quiet to the action ...

11:50 Changeset [222:661fa930e1aa] by elpolilla

Small piece of code moved for not complying with conventions

09:13 Changeset [221:e7d9bc1c41d8] by elpolilla

some small semantics errors fixed in xpath test

08:14 Changeset [220:e933ec04f26d] by elpolilla

implemented extract_unquote method over xpath selectors

09/10/08:

11:06 Changeset [219:344b703d1a4b] by olveyra

- AdaptorPipe? compilation feature to resolve the problem of efficience and ...

09/09/08:

23:49 Changeset [218:cc0524c61580] by elpolilla

bugfix in decompressor tool

12:05 Changeset [217:4aad6de30ddb] by olveyra

Added getd command, similar to get but first decompress the ...

11:12 Changeset [216:a91709634c32] by elpolilla

pylinted decompressor

11:10 Changeset [215:a963dca75d67] by elpolilla

decompressor tool improved

11:10 Changeset [214:01c82b133da1] by anibal

Added more flexibility to childclasses who override gespider command

10:02 Changeset [213:8d99a8e1e0eb] by elpolilla

added sample data for decompression tool's test

09:56 Changeset [212:28e574f3afa3] by elpolilla

added response decompression tool

09/08/08:

09:06 Changeset [211:d548356aea33] by olveyra

minor code fix

09/06/08:

14:03 Changeset [210:3aa8466a6e75] by olveyra

Removed PRIORITY constants, added DEFAULT_PRIORITY setting

Note: See TracTimeline for information about the timeline view.