NYCPHP Meetup

NYPHP.org

[nycphp-talk] Search for NYPHP List Archives available (alpha ver)

Jayesh Sheth jayeshsh at ceruleansky.com
Thu Apr 22 01:16:51 EDT 2004


Hello all,

I have been wanting to search through the treasure of information 
contained in the NYPHP archives from 2002 to date.
Thankfully, the guys at nyphp.org made the whole archive available as a 
(Unix mail) mbox file.

I used a utility from Fookes software (fookes.com) called Mailbag to 
extract these files as a big csv file. (Originally I thought I would use 
Mailbag's built-in search feature, but that proved too cumbersome to 
use. Plus, I wanted to customize the output of the search.)

 I then imported that file into a MySQL database table, and enabled 
MySQL fulltext search for that table. (I had to do a manual search and 
replace with another Fookes program, Notetab, to get certain character 
escaped. E.g. quotes and dollar signs.)

I then added on a PHP search interface to it.

With a lot of luck I seem to have got a basic version working. Having 
this large set of information publicly searchable will no doubt be 
useful to others.

 For example:
http://www.ceruleansky.com/nyphp_mail/index.php?q=xml-rpc

I am not sure if it works perfectly yet (since it is just a day or two's 
work). (I  built on some functions which I had written before, so it 
went relatively fast ...)  It currently returns a set of up to 25 
matches. There is no built-in support for displaying messages by thread, 
or for fine grained searching (by field or with boolean expressions).

Please try out this alpha version and let me know if it works and if it 
is of use.

Best Regards,
- Jay Sheth




More information about the talk mailing list