I opened both libraries (from ftpsearch plugin, parser folder)
1. Archie
None of provided (commented) Archie hosts works (from 1253 rcdata resorce)
I'll check later if I can find working one with Google, and provide working plugin to others.
2. Filesearch
Problem with filesearch engine can be solved by adding Referer header to request, i.e.:
Code: Select all
Referer: http://www.filesearching.com
This is my deduction of regex scheme for filesearch engine, thou I doubt anyone will find it useful:
Code: Select all
RCData resource: regex pattern
------------------------------------------------------------------------------------------------------
1150: ^ *?(\d+?) *?<img src=/img/(.+?)\.gif width=16 height=16> (?:<b> *?(\d*?)</b>| *?<(.*)?>) ...
^--^ ^-^ ^----------------^ ^--------------^ ...
| | | | ...
idx icon name file size tag ...
... (\d\d)/(\d\d)/(\d\d\d\d) <a href=.*? class=ls>(.+?)</a>(?:<a href=.*? class=lg>(.*?)/?</a> ...
... ^--------------------^ ^-^ ^-^ ...
... | | | ...
... obvious date (not present) host name path name ...
... <a href=.*? class=lf>(.+?)</a>|<a href=.*? class=lg>(.*?)/?<b>(.+?)</b>/?</a>)$
... ^-^ ^-^ ^-^
... | | |
file name path name file name
------------------------------------------------------------------------------------------------------
1151: <a href=(.+?)><img src=/img/moreengl\.gif width=\d* height=\d* border=0></a>
^-^
|
link to next page
------------------------------------------------------------------------------------------------------
1152: ^ *?(\d+?) *?<a href=/cgi-bin/s\?q=.*?&l=en&e=on>(.+?)</a>$
^
|
not clear as I can't find similar match
------------------------------------------------------------------------------------------------------
Which again doesn't work, as any host provider changes will affect this fragile scheme, as it does.
Things like this should be provided to end user in open format. Like XML. Closing it is always bad idea.
Now plugin can't be easily extended and more importantly issues can't be repaired (until the next version, and next minor change or third parties end)