Win a copy of Mesos in Action this week in the Cloud/Virtualizaton forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Best DBMS for a scenario

 
David Weber
Greenhorn
Posts: 9
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hey all,

I need your advice on the best DBMS and Setup (High-Scalability, High-Performance, High-Reliability) for the following scenario.

Current structure:
Currently I am using MySQL on one server, with INNODB.
But I plan to setup a cluster.

Tables:
Person Raw Data (blobs, this data is just archived and will possibly never be used)
Person (This object will be searched and looped for most application operations)
Person Address
Country

Database use:
The database will store about 3 billion people and their details.
My application (distributed-system) will need to verify people, and identify people.
(Unfortunately identification will require looping through every person until a match is found due to the nature of the identification, a simple "select * from `blah` where `blah` = 'abc' " wont work).

The requirements for my database:

  • Store Up to 3 billion people.
  • Retrieve all objects quickly. For identification loop.
  • Store about 15 blob objects with sizes 20-100kb per person
  • Need some way to join data, but mainly will only be using the person structure.
  • Identification and verification will need to be as quick as possible


  • Im thinking maybe MongoDB will be better suited to the task.

    Regards,

    David
     
    Peter Johnson
    author
    Bartender
    Posts: 5852
    7
    Android Eclipse IDE Ubuntu
    • Mark post as helpful
    • send pies
    • Quote
    • Report post to moderator
    Perhaps one of the nosql database, such as HBase (from Hadoop), Cassandra or CouchDB would meet your needs. See http://nosql-database.org/ for lots of nosql database options.
     
    • Post Reply
    • Bookmark Topic Watch Topic
    • New Topic