Free Data Sources For Madlib Sites

Posted 1020 days ago - Free Stuff

So you've read Blue Hat SEO's awesome write up detailing how to use Madlib Sites to your advantage, now what? Over the course of a few posts, I'll provide some additional information to compliment Blue Hat SEO's post. Clearly, this post contains a list of free resource candidates to choose from for building your Madlib Sites.

Government Data

Government data sources are great simply because the content is usually: accurate, concise, and there's an abundance. For starters, bulk.resource.org makes the following available via http, rsync, BitTorrent, and ftp.

    • Commerce Business Daily
    • Public Safety Codes
    • Common Utilities
    • U.S. Copyright Database
    • The Judiciary
    • SEC EDGAR Database
    • General Accountability Office
    • Government Printing Office
    • Internal Revenue Service
    • National Technical Information Service
    • Patent Full Text Database
    • Robotic Guidance
    • Smithsonian Institution
    • Trademark Database

Wikidumps

Anyone looking to build a Madlib site probably already knows Wikipedia releases their entire database dumps to the public. Grab them here: http://meta.wikimedia.org/wiki/Data_dumps

GeoNames - Country Dumps

Geonames.org has a collection consisting of country related data. The usual: Latitude, Longitude, Elevation, Population, Timezone, Country Code, Capital, Area (in sq. km), Top Level Domain, etc. Start loading this into your Madlib arsenal if you're looking to target the world :] http://download.geonames.org/export/dump/

The Obvious: Google

The advanced search operator filetype: is very handy for our situation. The query: mysql dump filetype:sql produced over 9,000 results at the time of writing. Get creative with this query and there's no telling what you'll come across. If you're interested in csv data, try filetype:csv, instead. Prefixed with anything that comes to mind, Google will usually provide a generous heap of information. You'll have to filter through some of the results to find the good stuff, but the effort will pay off in the long run.

Oddity Software

In addition to their non-free databases, Oddity Software's generous collection of free data sets are another must-have for any madlibber. If you haven't already, visit http://www.odditysoftware.com/free_lists.html to acquire an excellent base to start from, or make a strong addition to the archive :D .

myDataMaster.com

The generous collection at http://mydatamaster.com/free-downloads/ provides the following types of data dumps in sql/zip (available via http):

  • US Street Suffixes and Abbreviations
  • US State and Territory Abbreviations
  • US Embassies
  • Famous Birthdays
  • Mixed Quotes
  • World Cities and Languages
  • Articles on various topics
  • Medicare Coverage Details
  • US Nursing Homes
  • US Medical Suppliers
  • Home Business Articles
  • Database of Random Facts
  • Education/College Articles
  • Complete Bible Website

The next few posts will provide some examples (with code) to build a fully functioning madlib site with our new library. I'm not a professional coder, so don't expect anything glamorous.

So, where do you get your data?

Word Count: 600

Tags: , , ,

Click Here to Submit a Comment

Permalink / Last Modified:

Support Nullamatix.com:

See Also:

  • 10/23/2009 -- DIY: Home Surveillance System with VLC
    Excerpt: "The current state of the global economy has shot the U.S. unemployment rate up - waay up. As a result, more people are willing to commit crimes in order to provide for themselves or their family. Just this year, two houses that share the street I live on were ..."
  • 08/31/2009 -- Howto: Insert Bash Command Output Into MySQL
    Excerpt: "A BlogStorm reader emailed me today, Hello, I am replying to your post on http://www.blogstorm.co.uk/how-to-scrape-pages-with-coldfusion/ Wanted to see your experience in page scraping, may need your help on a project. HOw much did you do beyond the ..."
  • 02/24/2008 -- TNX.Net – A Helpful Resource, or Potential For Disaster?
    Excerpt: "The majority of webmasters are aware that in order to rank high in search engines, permanent backlinks are required, and TNX.net provides just that. In this post I'll elaborate on the ins and outs, the pros and cons, and whether or not TNX.net is a service ..."
  • 02/14/2008 -- A Classic SEO Tactic The Gurus Won’t Reveal
    Excerpt: "Search engine optimization and traffic are topics nearly every webmaster is concerned about. Many resort to social networking sites in an effort to gain exposure, but according to several performance indicators, traffic from these sources performs poorly. ..."

2 (Comments|Trackbacks)

[ RSS feed | Trackback URI | Leave a Comment ]

collapse John # @ 2009-12-02 04:17:52

Thanks for these, hopefully going to be put to good use once I brush up on my PHP

 

Leave a Comment

Comments are moderated prior to showing up. If your comment does not show up immediately, please do not attempt to resubmit. If you're redirected to the original post after pressing "Add Comment", your comment was successfully entered into the moderation queue.

Trackback Responses to This Post: