I know what you're thinking: two "Boaty McBoatface" articles within two weeks? And we're past April Fool's Day?
But since I posted my original analysis about the "Name our ship" phenomenon that's happening in the UK right now, a new contender has appeared: Poppy-Mai.
The cause of Poppy-Mai, a critically ill infant who has captured the imagination of many British citizens (and indeed, of the world), has made a very large dent in the lead that Boaty McBoatface holds.
Yes, "Boaty" still has a-better-than 4:1 lead. But that's a lot closer than the 10:1 lead (over "Henry Worsley") from just over a week ago. Check out the box plot now: you can actually make out a few more dots. Voting is open for another 10 days -- and as we have seen, a lot can happen in that time.
As I take this second look at the submissions (now almost 6300) and voting data (almost 350,000 votes cast), I've found a few more entries that made me chuckle. Some of them struck me by their word play, and others cater to my nerdy sensibilities. Here they are (capitalization retained):
- Slartibartfast
- DO NOT LICK THIS SHIP
- I DO BELIEVE IN FERRIES
- A View to a Krill
- COLD FINGER
- WHATCHA TALKIN BOAT WILLIS
While I'm on this topic, I want to give a shout-out to regex101, the online regular expression tester. I was able to develop and test my regular expressions before dropping them into a PRXPARSE function call. I found that I had to adjust my regular expression to cast a wider net for valid titles from the names submissions data. Previously, I wasn't capturing all of the punctuation. While that's probably because I didn't expect punctuation to be part of a ship's name, that assumption doesn't stop people from suggesting and voting on such names. My new regex match:
title_regex = prxparse("/\'title\':\s?\""([a-zA-Z0-9\'\.\-\_\#\s\$\%\&\(\)\@\!]+)/"); |
I could probably optimize by specifying an exception pattern instead of an inclusion pattern...but this isn't the sort of project where I worry about that.
Will I write about Boaty McBoatface again? What will my next Boaty article reveal? Stay tuned!
8 Comments
Thanks for sharing a sequel to the Boatyard McBoatface story... I'm definitely hoping you'll write another post within the next ten days and maybe at the end of the race too. Those names are hilarious! Had me giggling...
Good tip with the online regular expression tester. Can imagine it would make it easier (and quicker) to check the tricky prxparse argument. BTW I don't see a closing double quote and bracket.
Thanks Michelle - this particular regex is driving our blog platform crazy -- probably lots of important code-injection logic getting triggered!
I've always used the regex tester at http://www.regxlib.com which allows you to load a local file or url into the data source. It also has a huge library of regular expressions people have submitted over the years so there's often one at least close to the one you want.
good tip - Thanks!
regex101 is a great companion to prx*. One nice feature is that you can download the entire website and run it off your local machine.
I can imagine! Was impressed to see it on the post as I was thinking about that. Reminds me of Bobby Tables :-) http://bobby-tables.com/
You might find entertainment in this person's name -- which means Nothing to me.
You're right... very entertaining.