Spiders and crawlers

Dave Cramer davec-zxk95TxsVYDyHADnj0MGvQC/G2K4zDHf at public.gmane.org
Mon Apr 5 16:53:04 UTC 2010


On Mon, Apr 5, 2010 at 12:41 PM, Lennart Sorensen
<lsorense-1wCw9BSqJbv44Nm34jS7GywD8/FfD2ys at public.gmane.org> wrote:
> On Thu, Apr 01, 2010 at 05:56:35PM -0400, Evan Leibovitch wrote:
>> I'm looking to implement a spidering system intended to look through a bunch
>> of catalog websites, in order to track changes to those catalogs (with the
>> help of a backend MySQL system).
>
> I always wonder: Why mysql?  Postgresql is an obviously better and more
> scalable choice.  Why do so many people just barge ahead with mysql?

Always wondered that myself, my first thought is that it isn't that
obvious to people who barge ahead with mysql.

Have you tried searching for 'java web crawler'

--dc--
--
The Toronto Linux Users Group.      Meetings: http://gtalug.org/
TLUG requests: Linux topics, No HTML, wrap text below 80 columns
How to UNSUBSCRIBE: http://gtalug.org/wiki/Mailing_lists





More information about the Legacy mailing list