News:

Yahoo Groups closing on Dec 14th 2019

Main Menu

Progress Update

Started by Wilson Logan, May 06, 2009, 11:53:07 PM

Previous topic - Next topic

Wilson Logan

Hi All,

We are currently able to capture Message Number, Date & From fields, which leaves Yahoo ID, Subject, Message Body and Email address (where available).

This is a big step forward because it means we have working code, we just need to fill in the blanks, parser wise.

Hopefully not too much longer now.

Cheers,

Wilson.

t2000kw


Wilson Logan

Sorry the news isn't more exciting.

Cheers,

Wilson.

MyNameIsScott

But still, proof of concept is huge.

Woo hoo!

Scott

Jeff Tundis

That's great news! It's all downhill now that the code is working! Thanks to you and your crew for all the hard work, Wilson.

Wilson Logan

Hi Jeff,

We are now tantalisingly close. The parsers work. The code works. But not together. One regular expression (which works fine in test) has decided not to work in the program. Its the regex for Yahoo ID:

(?:[\s\S]*?)href=\"http:\/\/profiles.yahoo.com\/([^\"]*)\"

My personal opinion is that its the unpaired quotation mark at the end. But I know nothing of regexs or Boost 1_34_1 which I suspect is the real issue.

Anyway, if any of the foregoing makes sense to you, please drop me a line.

Cheers,

Wilson.

nightstitcher

re:
(?:[\s\S]*?)href=\"http:\/\/profiles.yahoo.com\/([^\"]*)\"

I think you're on the right track with the concern about unpaired quotation marks. I don't know RegX, but I have worked in VB (I think you mentioned that's the language in use), and it can be picky about special characters, which would certainly use the quotation mark, especially if used in or with a string variable or constant. They may need to be doubled, replaced in some contexts with single quotes, sent as character codes (e.g. chr(34)), or other workarounds. Try running your code in immediate/debug mode and see if there's a difference between what strings you're passing or generating, and the strings you think should be generated by your code.  I've driven myself nuts with this kind of thing. Also (this may not be at all relevant), if SQL or XML are involved, they can also choke up around certain control characters, such as &, which may need to be quoted, doubled, or other workarounds.


micromusic

that regular expression makes no sense (to me). what are you trying to find as target - from what source line ? you mentioned boost, so the regex syntax would be pearl regexes ?

guess: you want the "__this__" part as target from something like

  <a href="http://de.profiles.yahoo.com/__this__">blablabla</a> </span>


it would be helpful if you could post a few lines of code showing what you are trying to attempt.

regards, uwe
select * from `mankind` where `brains`=='true'

Jeff Tundis

Sorry, Wilson. WAAYYYY over my head. Just trying to give you a little moral support :)

Wouldn't it be funny if it was two single quotes ' ' as opposed to a double "  ::)

I did find this little tool online. Don't know if it'll help, but what the heck:

M2 RegX 1.1
Regular Expression test tool, interactively learn regular expressions
This program is free to download and use. It is designed to help the end user interactively learn Perl Compatible Regular Expressions. This tool is designed to help RSM end users design User Defined Quality Notices for RSM Version 6.60+.

Scroll down to where it says "Download Link 1"

www.topshareware.com/M2-RegX-transfer-17754.htm

Wilson Logan

Hi Jeff,

I'm using a great tool called RegexBuddy.

I hacked the final \" off the end and it doesn't make any difference in RegexBuddy. I can't test it yet in the program as I have to install VS2005.

I'm a bit disappointed by the people helping me. Urgency does not appear to be their strong suit.

I will attempt to install VS2005 tomorrow and although I don't know C++ I can have a bit of a stab at testing. Some testing is always better than no testing.

Cheers,

Wilson.

Denos

Wilson,

Like others I just want to thank you for the struggle you are engaged in on our behalf. I've read your recent messages  with complete fascination - hardly understanding a single word.  What an awful shames Yahoo is not able to share their secrets with you. After all, if anything, you increase their traffic, not decrease it.   Dennis aka Denos aka Osnacantab

SMF spam blocked by CleanTalk