My Photo

May 06, 2007

The Problem For Google

I think Google and the other search engines have a problem.    I think the programs are dynamically generating too many web pages for Google to crawl and index.

Has anyone noticed all those news announcements about the new Google data centers.   They are going to consume all the power of Niagra, or whatever the local dam is near where they are being built.  They do not want power consumption figures to be in the public record.  Does that bother anyone?  Arn't computers supposed to be getting more efficient?   Have you seen those graphs about growth of size on the internet.  I think that all of our little tiny servers are going to overwhelm Google.

I do not think that there should be that much data on the internet.   Not that many servers are needed.   Just look at my site.  Aside from the resume about 20Kb, I can define a person using about 100 bytes of data.   Say there are 6 billion people registered.  I can save all that data in 6*10**11 bytes of data.  Excluding Resumes.  600 Gigabytes.  I can do that on 1 server!  Maybe the resumes are  more.  A quick scan of my resumes shows they average 20Kbytes, so the resume data is another 120*10**12 or 120 Terrabytes.  Maybe each person has a few things to sell, and some photos.  and some videos, so the size of the web could be several times that.   Really not that big. 

So why do they need all the servers?  Because dynamically generated web pages  are increasingly numerous, and are much much larger than the core data.  Google has an exploding problem to index just my site, let alone all the others ones. .  My 100 bytes of data becomes a web page of 24.55 KB.   

It will soon get worse than that.  Say I have each candidate draw a polygon where they are willing to work, then I can have 50 more job markets for the 50 states.  Google would have to index 50 times more stuff.  A little less than that because not every candidate will work anywhere.  But worse yet, say I start putting up job boards for each city in California, think 1000 cities, 1000 time more web pages that Google would have to index. 

Eventually there will just be too many pages for google to index.    They could call me a web spammer, and disbar me, but really I am not, I am putting up legitimate pages.  I am putting up many different views of the same data.   I am just overwhelming their servers.

I think that is what is happening.  Google may be selling themselves as having these huge assets call server farms.  But historically I have always treated my computers as depreciating assets, with a short half-life.  It sounded like a weird sales pitch to me.  Invest in us, because we own lots of depreciating computers.  Weird.

I think google has a problem.

Job Market Widget Changes

Well still nobody had noticed my job market widgets.  I need to create a job board for bloggers.  That is my target market here.  And then I need to have the display page be the list of jobs.   That is more typical.
Any other recommendations.  Do you like the blue color?  Should I upgrade the webpage on my site?  Should I make it possible to individually edit the features of the widget?  HELP.  I need some market feedback.

Widget Gallery Submission

I just submitted my widgets to the typepad gallery.  I guess they work on my blog, what else can I ask for?

Talk about rapid prototyping.  I still have no real idea what I have just created.  What will people think about it?  WIll you use it?  How many users are there here anyhow?

Please Please tell me what you think about what I am doing.  This is all new to me.  I think this is interesting stuff, but what do I know.   The question is what will the users think?

May 05, 2007

Evolution of Job Boards

Email Lists
    Before the web was born there were a number of email lists for jobs.  I ran one. 

Monolithic Job Boards
   Monster, hotjobs, dice, career mosaic history is littered with the remains of these monolithic job boards.  One board does everything.

Niche Job Markets. 
    Job Coin and others offered the services of building a niche job board attached to any website.

Industry Specific Job Boards.
   These made more sense.  ONe job board for each profession.  Do a google search on your favorite profession and its keywords, and you will find lots of these.

But none of these addressed the parallel nature of the internet.   A job can be found on any job board.  So what does one do.

Vertical Search in Job Boards.
   Indeed.com and simplyhired.com aggregate information from many job boards, and bring those jobs together.  Good idea.

What is next?

Job Market Widgets
    Let every web site host a widget or two for the job markets it is interested in.  That way end users stay on their favorite web sites, and also find the jobs in their industry. 

Job Market Widgets

I am about to release job market widgets onto the blogosphere.  One linux developer was telling me:

Whenever I do a search on any of the big job boards or vertical search engines for linux jobs, I get lots of windows jobs, that mention linux.  It drives me nuts.

With linux job market widgets you do not have that problem.  These widgets are only posted on linux related blogs, and only receive linux related jobs and resumes.     Check out linux.freerecruiting.com/Resumes/Widgets for our linux widget.  We offer 38 and counting specialty job market widgets. 

Blog powered by TypePad