There are so many weird and wonderful things on the big search services, you need cheat sheets to keep track of the specialized types of search that they provide. The Yahoo! Shortcuts page has a bunch o' tricks for searching Yahoo! The Google Cheat Sheet has coverage on the search operators and parameters that can be fed to their query systems. Well, we don't have crib notes or hacks books about us (yet) at Technorati, but we're working on being that cool, too <g>
( Apr 28 2006, 10:51:05 PM PDT ) PermalinkThere are lots of ways to ping Technorati. Your blogging platform may do it already. You could use a slick editing tool like ecto that will do it for you. You can even roll it yourself in c-sharp. But however you do it, it's important that you let Technorati include you in the distributed conversation by notifying that you've posted.
Recently, there have been some problems with Ping-o-Matic that I worked with Matt to unravel. If you use a ping relaying service and are having difficulty getting indexed by Technorati, please ping directly! Of course Technorati will continue working with ping relayers such as Ping-o-Matic, Pingoat and so forth; they're providing a valuable service to the blogging communities that we are grateful for. However, when in doubt take the direct route!
More soon to come on the Technorati Weblog!
technorati ping pingomatic pingoat ecto blogging
( Apr 27 2006, 11:27:31 PM PDT ) PermalinkSince Niall mentioned GData, I've been meaning to look into it further. Today Otis mentioned that one of the Apache/Summer of Code projects proposed is a lucene-based GData server implementation.
I took a look at the docs and realized that this is actually a really old spec, as old as the epoch as a matter of fact. Check it out:
But seriously folks, the G-man and his crue have done a fine job providing client implementations (as long as your not waiting on Ruby or one of the P-languages, no perl, python or php yet). Even a nice set of examples for the Java implementation. Thanks, G!
google gdata lucene summerofcode ruby perl python php
( Apr 25 2006, 08:17:45 PM PDT ) Permalink
Over the last year and a half, I've spoken extensively to friends, colleagues and audiences about web spam. At eTech I showed how spam blogs behave statistically atypical, as soon as you start looking at the publishing characteristics (such as linking and posting rates), the spam comes percolating up to the top. For instance, this chart is a sample of linked-to domains from blogs hosted by Google's blogspot service (the y-axis is in thousands of links per day):
The highlighted domains are sites that the spammers are trying to put in front of mouse cursors by making them look important. Besides being a nuisance, this is part of a larger hazard to the whole web advertising market.
While Technology Evangelist pointed out Google News' role in the spam ecosystem, Niall has previously noted that Google provides lots of tools for perpetrators of web spam to employ. AdSense is the prime object and subject of spam. A video posted yesterday on YouTube details the anatomy of some typical AdSense abuse:
There has been a recent explosion in abuse of AdWords, Google's PPC (pay per click) advertising platform. It is apparent that the techniques do not follow Google's own clear guidelines. Unfortunately, users and legitimate advertisers pay the price, while Google and the unscrupulous advertisers profit.Web spam isn't new, this has been going on for years and at this point, one must conclude that Google's not serious about doing anything about their spam problems. There used to be a qualification step for AdSense and they'd reject sites that didn't meet some basic criteria as legitimate sources of content. But that stopped about a year and a half ago and the spam has been pouring on ever since. The video is a call to action to complain to Google; they have a fidicuary responsibility to move against the abuse. The video shows a search for forklifts to illustrate rampant ad policy violations. The video narration asks, "Why is google allowing this to happen?" and answers
Google has always put its users first and one would hope that they will continue to abide by their charter. My aim with this video is to urge them to do something about these issues sooner rather than later.
This is the dirty little secret about these tactics: Google profits from every click on it's network. Be it on ads that are clicked on google.com or ads that are clicked on websites that are running google ads throuth the adsense program. This is a very difficult thing for them to self police because doing something about it will effect their short term profits.I'm not condemning AdSense per se, it's a great service and revitalized web advertising after the flame-out a few years ago. But black-hat SEO's have definitely cranked up the game over the last year or so and are putting the whole market at risk again. Google may not be particularly motivated to go out and find the abuse but they have to act against it when you bring it to their attention. Watch this video:
web spam splog google spam adsense adwords splogs
( Apr 25 2006, 12:28:25 PM PDT ) PermalinkA few months ago, I mused that we should be able to abandon FastCGI (with extreme prejudice) and use AJP13 with Ruby on Rails instead. Well, unbeknownst to me at the time, someone was hatching just such a plot, the Ruby/AJP Project! I'd heard last month that David Andersen was tinkering on installing it... well, he not only got it online but he blogged how he did it. Take a look at his compile time and run time configuration details using Apache 2.2's native AJP13 protocol plugin for mod_proxy (i.e. no mod_jk, good riddance), it's really cool! Way to go, David!
rubyonrails ruby ajp13 apache mod_proxy mod_jk
( Apr 24 2006, 08:48:30 PM PDT ) PermalinkJust for giggles, I fiddled around with the Google Maps API on the Happy Valley Odyssey of the Mind blog by putting up a map of Ames, Iowa, where the Odyssey of the Mind World Finals take place.
I tried setting polylines to outline the perimeter of the Iowa State University campus but it seems like the Google Maps API is pretty brittle; if you get it wrong, there's no debugging apparent (I checked the Javascript console), you just get a blank map. I'll have to poke at it a bit more some other day. Doing the basic stuff is easy though, check out How to add a Google Map to any web page.
Be sure to stop by the Happy Valley Odyssey of the Mind blog and make a donation, it's a great cause!
googlemaps odysseyofthemind ames iowa
( Apr 23 2006, 07:36:08 PM PDT ) PermalinkIf there's anything to be said for the innovations in the tools of creation and distribution of our present day, it's that contemporary political humor has gotten so much funnier!
Thanks, Adam!Sitting on my own brain, waiting for the end of days
Corporation profits, Bloody oil money
I'm above the law and I'll decide what's right or wrongI am the egg head, I'm the Commander, I'm the Decider
(check it out)
Koo-Koo-Kachoo
humor political humor bush beatles parody decider
( Apr 22 2006, 05:26:39 PM PDT ) PermalinkThe Java backlash that began a few years ago was mostly a J2EE backlash, not against the Java language per se. Too many people took the blueprints too seriously, too literally or just too damned religiously. Too many applications that didn't need EJBs were using them, letting the container manage low level application plumbing invited slow and buggy behaviors that were painful to debug. The backlash has made a lot Perl/Python/PHP enthusiasts express self-righteous vindication and have helped morph the J2EE backlash into a broader Java backlash. Geez, even IBM is getting all spun up on PHP, whodathunk? But I think the dismissal of Java is premature. None of the P languages or Java are without hazards. These days a lot of developers are over the blueprint kool-aid and are standardizing on a simplified and productive stack:
To really bring rapid development and prototyping to a Java environment, there a lot options to look at such as dynamic JVM languages:
I expect in the months ahead to be writing applications with plugin support and that the big win for the dynamic JVM languages for me will be in easing the rapid development of plugins. In other words, I probably wouldn't write an end to end application with them but given a set of interfaces for extension points that can be automatically tested, writing the extensions in JRuby or Groovy sounds compelling.
I actually haven't had time and opportunity to substantially try half the things I've mentioned thus far. Surveying the number of tools, languages and frameworks it's clear that there are a lot of things to consider and that a lot people are concerned with (and working hard on) bringing the down the high ceremony of Java. I'll still be using P languages in the future, too. Down the road, I suspect virtual machines (JVM? parrot? mono/CLR?) will make a lot of these issues fade away and the questions at hand will be around when to use closures and when to use objects, when to annotate and when to externally declare, when to explicitly type or auto-type and so forth. The languages will be incidental as they support shared constructs and virtual machines.
java rubyonrails groovy jruby jython maven eclipse xdoclet springframework programming
( Apr 21 2006, 09:03:42 PM PDT ) PermalinkI've been wondering how Lego will maintain a business around Mindstorms and at last, I think we have an answer: they'll hop on ye olde cluetrain. By enabling the community of Mindstorms enthusiasts to drive innovation openly, I finally feel confident that the Mindstorms technology will enjoy long term viability. From the Gizmodo post:
Jon Lund took some time out from liveblogging the CustomerMade conference in Copenhagen to email in and tell us that according to Soren Lund of Lego, the software behind the upcoming highly anticipated Mindstorms NXT will be published as open source; Lego is currently in the last stage, figuring out which public domain license to use before releasing it. Power to the people! (read on)The dreaded EOL'ing scenario, such as that suffered by the Sony Aibo, would have been a really crappy outcome for Mindstorms. Instead, they're innovating and opening up. Thanks, Lego! Oh, and one hting: BSD/Apache style licenses, please!
lego mindstorms opensource sony aibo
( Apr 20 2006, 03:25:53 PM PDT ) PermalinkI suspect I'll be opting to casual carpool more often with BART eliminating free parking at the Contra Costa county stations I frequent. That could raise my already-not-inconsequential commute costs 15%. And how timely. Gas is already exceeding $3/gallon and the chatter on the radio is to expect $4/gallon! In that scenario, I wouldn't be surprised to see carpool drivers putting a cup out for the riders as their fuel prices put them in the squeeze. Meanwhile, the Big Oil Companies are ringing in record profits...
One possible ray of light for the BART ride option is the report of WiMax service coming:
WiMAX is similar to WiFi but can carry signals across greater distances. WiMax is also being considered by Silicon Valley public transportation officials (free registration) who want to let passengers browse the Internet on local train systems like BART. They want to run a test from July to December. WiMAX, they believe, might be a better technology to do hand-offs as the train rushes through various wireless coverage zones. read on(via burtonator)
Maybe the price hikes will help them pay for a software test harness; BART's bugs have rendered the system unusable in the past.
( Apr 20 2006, 07:19:57 AM PDT ) PermalinkThis is totally amazing! With all of the gazillions of dollars and BTU's of hot air poured out over "homeland security," here comes Marc Ecko laughing in the face of the beast by tagging (as in, the spray paint kind, not folksonomy) Air Force One!
Coming next: "Mark Ecko In Gitmo"
airforceone graffiti homelandsecurity humor tagging
( Apr 19 2006, 09:29:21 AM PDT ) PermalinkHere comes a changing of the guard at the War House: McClellan Out as White House Press Secretary. Now I just wonder what the final parting words from Dubya to exiting Press Secretary Scott McClellan will be, let's see:
politics bush scottmcclellan whitehouse
( Apr 19 2006, 06:43:50 AM PDT ) PermalinkI posted last month about how winning feels good. With the thrill of victory comes a new challenge: what's next? Well, my daughter's team's second place showing in the Odyssey of the Mind regionals was followed up by first place in the State competition, so next up: the World!
The World competition is in Ames, Iowa. I've never been there. I've flown over Iowa plenty of times, traveling to and from Chicago. But the corn fields, cattle corrals and pig pokes of Iowa ... will be a new to me. All told, we're running up thousands of dollars to pull this off but I'm sure for the kids this will be one of life's great experiences, so it's all worth it. I have an alter-ego running a separate blog to track that endeavor and our challenges. We've got paypal links to accept donations (tax deductible, even) but simply talking about and linking to that blog will help, so please shine a little light on us.
( Apr 18 2006, 05:20:20 PM PDT ) PermalinkToday, like many days, the phrase "user generated content" left my lips in the course of conversation. It's a habit. OK, maybe it's a bad habit. Since Tim Bray posted about his hatred for the label, I've been increasingly self-conscious about using those words. I agree, it's laden with exploitative connotations. Derek Powazek adeptly decomposed the nastiness further. Yes, not long ago editorial, movie editing, audio mixing and other tools of creation were only accessible to the pros. Yes, the burst of creativity that has accompanied the mass-amateurization of media of all kinds begs for an improvement of the vernacular. However, Scott Rosenberg, lamenting the absence of a credible replacement, reminds us that content from the pro's still has value (Seymour Hersh didn't blog the latest plan of attack, now did he?). Breaking habits often requires conscious adoption of an alternative. So, what? People Contributed Media? Individual Creations? Actually, I'm more intrigued by "user distributed content" but maybe I'll post about that later and then I'll have to wring my hands over a better name for it.
( Apr 11 2006, 09:57:02 PM PDT ) PermalinkI've always been fond of public libraries, they can be a great resource and it's so... non-web (2.0 or 1.0). I've used 'em for youth sports coaching materials, current non-fiction, jazz CDs and such. They can also be real funny; sometimes they have contemporary titles but other times there's no hope in getting what you want without going out to the bookstore to buy it. I did a search for a title, "What Would Buddha Do at Work" on Contra Costa Library catalog and the first hit was for What would Buffy do? : the vampire slayer as spiritual guide. Buh!
( Apr 11 2006, 10:52:37 AM PDT ) Permalink