Ryan Hitchman
|
c2a607198c
|
PEP8
|
2010-08-29 22:35:27 -05:00 |
melonhead
|
93f626c482
|
Removed debug print lines from URL normalizer
|
2010-07-14 16:48:18 -04:00 |
melonhead
|
61c42e7d8a
|
urlhistory: main regex no longer matches 'http://' or 'www.'
urlhistory: added URL normalization for Amazon, Waffleimages, and Youtube
|
2010-07-14 16:45:26 -04:00 |
Ryan Hitchman
|
627b83039c
|
clean validate, pep8, remove CRs
|
2010-03-12 23:16:06 -07:00 |
Ryan Hitchman
|
ee8d51dc62
|
remove cruft from urlnorm
|
2010-03-03 20:25:13 -07:00 |
melonhead
|
bb709a74bf
|
Added URL normalization to urlhistory module to allow better detection of duplicates
Added configurable ignored URLs to urlhistory module
|
2010-01-18 15:07:06 -05:00 |