elz: (Default)
elz ([personal profile] elz) wrote2010-04-28 11:23 pm
Entry tags:

Too braindrained for subject lines

-I'd avoided working on it for *cough* two-and-a-half years, but I'm finally poking at the AO3 html parser. I've thus far succeeded in triggering a number of infinite loops, rewriting it to be 10 times slower, and causing bits of text to come out in a completely jumbled order (which makes for some interestingly surreal reading). If you need me for anything, I'll be off in the land of nodes, deep in the forests of recursion. Wish me luck.

-Finally watched The Time of Angels. Very creepy, Amy was awesome, and the Doctor was great. She saved herself! He bit her hand! Not that keen on River Song, but I'm curious to see if we learn more about her next week - I hope so.

-Have also been re-reading Agatha Christie every night before bed. I'd forgotten the extent to which the early Hercule Poirot novels are all about Hastings'... lack of perspicacity, shall we say? Really, the entire point of both The Mysterious Affair at Styles and The Murder on the Links is that he's entirely clueless and instantly infatuated with every pretty girl who walks by, and the plot hinges on it on multiple occasions. I'm halfway through The Big Four (not her best work), and age and experience only seem to have tempered those qualities of Hastings' to some extent. (I had also forgotten that he was fairly young when he first appeared, although it's possible I had a different scale for relative age when I was 12.)

"Unlikely friendship" is kind of an understatement for those two, isn't it?
samvara: Photo of Modesty Blaise with text "All this and brains as well" (Default)

[personal profile] samvara 2010-04-29 04:40 am (UTC)(link)
*loves on your brain*

I loved how cool Amy is and I loved it when he bit her hand!
flamebyrd: (Default)

[personal profile] flamebyrd 2010-04-29 06:17 am (UTC)(link)
I tried to reread The Mysterious Affair at Styles and couldn't stand it for Hastings' attitude towards Poirot. He is quite consistently doubting in most of his appearances, I find it quite bothersome.

I've been listening to the Christie BBC Radio plays on my way to/from work, it's been fun.

PS: Let me know if you want to bounce ideas for the HTML parser? I know I kind of messed it up before, but I did learn a lot about the issues involved and can probably at least discuss it.
zooey_glass: (SPN: Dean - happy beer)

[personal profile] zooey_glass 2010-04-29 08:33 am (UTC)(link)
Dude, you totally didn't mess the parser up! You did an awesome job - in fact we wound up reverting to yours after deploying the latest new version (superfast! But tragically flawed!). I think the parser is basically a beast - Elz is hoping to try and switch a lot of the heavy lifting over to nokogiri so that slightly less of the beastliness is handled by our code *g*
flamebyrd: (Default)

[personal profile] flamebyrd 2010-04-29 02:18 pm (UTC)(link)
I tried something with the precursor to Nokogiri for a while, but eventually decided it was taking me too long and that to actually fix the bug I had originally set out to fix the regexes would do.

I had exactly the same problems you're discovering. I think finally I was toying with dumping it into the parser with no modification, and then iterating through it node by node and creating a new document, adding paragraphs and newlines where appropriate. That required jumping into recursion, though, which didn't thrill me, and I had a lot of trouble with text nodes vs. elements.

Originally I was putting multiple
tags between inline elements, but I wonder if they look different to our paragraph linebreaks with our stylesheet? Maybe it would be better to add a style attribute to any block elements inside them, or maybe add a surrounding div.

*ponders*
If we're on Ruby 1.9 (I think that was the number) there's a new regex engine we could use that would allow lookaheads, thus allowing us not to have to read in entire blocks of tags, but it probably wouldn't be any faster. XD
zooey_glass: (SPN: Dean - 'Awesome!')

[personal profile] zooey_glass 2010-04-29 08:29 am (UTC)(link)
Yeay Elz! *cheers you on* You can defeat the parser!

I really liked 'Time of Angels' too. The angels are the creepiest monster ever, I am always pleasurably freaked out by the episodes with them in. I've liked River Song in the past and was pleased when she turned up again, although I thought they laid the all-knowing wife thing on a bit too thick this time.
enigel: David Tennant looking like he's doing an angry Weevil impression (DW Tennant manic)

[personal profile] enigel 2010-04-29 01:26 pm (UTC)(link)
I am so, so hoping it won't be something as simple as the perfect wife. She's never actually said they'll be boinking, has she? Just that she knows his real name, they will have had many adventures yadda-yadda. Because, well, why did she leave him or he her then?

The angel pulling a Samara and escaping out of the television - yeah, thanks, I was creeped out FOREVER by The Ring, really needed a repeat. I watched from between my fingers, sort of.

And for Elz: good luck, don't get eaten by a grue! *gives tiny code fairy to light the way, and a machete for the thicker parts of the code jungle*
cesy: "Cesy" - An old-fashioned quill and ink (Default)

[personal profile] cesy 2010-04-29 10:35 am (UTC)(link)
*cheers you on against the parser*