How to handle over 300K records with MySQL

12 years ago

I guess more info might be needed as to what exactly they do and what is the bottleneck. MySQL is not as bad as many people say. Use pagination+ajax, prepared statements, and/or optimize your queries and you should be fine.

Thanks

Signature

Inspire and engage your fans and followers, effortlessly
Collaborate with each other to promote your pages and links

{{ DiscussionBoard.errors[5357637].message }}

eminc

12 years ago

Originally Posted by RankSale

One of our SEO client is facing a problem to handle their 150K records a table. They have 150K product item in their products table. The worst is they often use left or inner join.

Seems like you need some optimization in the queries as well as
mysql database tuning. Problem could be at two places, sometimes
we write queries which take long time, or there are design flaws due
to which the database is not good enough to handle large amount
of data. Better consult a Database Administrator for that :-)

And google is always your friend. If you are facing problems in specific
situations, you can always google for a tuning option.

One that explains optimization of sub queries and joins is here:
How to optimize subqueries and joins in MySQL at Xaprb

Originally Posted by RankSale

Our engineer suppose need to seperate the database or just simple upgrade the hardware.

I think you should check optimizing your existing instance first before
going for a hardware upgrade. If the problem lies in the query, sooner
or later you will face the problem once again.

-Mohit

Thanks

{{ DiscussionBoard.errors[5358913].message }}

jminkler

12 years ago

Theres absolutely no reason mysql couldn't handle 300k .. this is very small. With the query that you are having a problem with, run "explain select ... from ... where"

and post the output here or in a pm ..

Thanks

{{ DiscussionBoard.errors[5361035].message }}

infinitewp

12 years ago

MySQL could handle way more than a million records. The major issue would be

1) Server bottleneck (Too many mysql connections) / Some software would use 90% of the resources.
2) The DB should be properly indexed (in terms of Primary key, foreign key etc)
3) Optimize queries reduce same set of data fetching cache them.

Thanks

Thanks
1 reply

{{ DiscussionBoard.errors[5388532].message }}

jasong714 12 years ago

I recently faced a similar challenge hosting a mysql server on an EC2 instance that was growing too quickly for me to keep up with.

I came across these guys:

Cloud Database

Check them out, scale beyond your imagination They work have both AWS and Rackspace Cloud instances. Lightning fast I/O and solid as a week old donut.
- Thanks
{{ DiscussionBoard.errors[5409192].message }}

Maraun

12 years ago

I don't think a cloud database is the right answer to a simple optimization problem. What they need to do in the first place is profile the app to find bottlenecks, then go about removing them starting with the worst. This may just be a matter of adding an index to a certain column, changing the way queries are written, caching will help a lot too if possible, or maybe even a database redesign to better accommodate the workload profile. For a more specific answer, hire someone to look at it or post more details.

Thanks

Signature

>> Cash in BIG TIME By Selecting The Right Adsense Keywords! <<

{{ DiscussionBoard.errors[5415393].message }}

RankSale

12 years ago

One programmar suggest not to use inner join or left join.
Instead, do selection for 2 times.
Like, select the id group first by the first table. and then use IN ( id group) for second time select. Make it in a loop.
He claims that can reduce the loading of A table x B table (like 5K records x 10K records) select issue.

How you think? will that reduce the loading of inner join?

Thanks
2 replies

Signature

CryptoCurrency Market Forums

{{ DiscussionBoard.errors[5561775].message }}

jminkler

11 years ago

Originally Posted by RankSale

One programmar suggest not to use inner join or left join.
Instead, do selection for 2 times.
Like, select the id group first by the first table. and then use IN ( id group) for second time select. Make it in a loop.
He claims that can reduce the loading of A table x B table (like 5K records x 10K records) select issue.

How you think? will that reduce the loading of inner join?

Couldnt say without seeing the explain plan.

Thanks

{{ DiscussionBoard.errors[8571808].message }}

wayfarer

11 years ago

Originally Posted by RankSale

One programmar suggest not to use inner join or left join.
Instead, do selection for 2 times.
Like, select the id group first by the first table. and then use IN ( id group) for second time select. Make it in a loop.
He claims that can reduce the loading of A table x B table (like 5K records x 10K records) select issue.

How you think? will that reduce the loading of inner join?

Almost surely no. A general rule of thumb is to always let the database do the heavy lifting if you can.

As pointed out earlier, using indexes is one of the most straightforward ways of optimizing performance, and will work as long as the fields that are being joined are not too large in length. If you're joining on a VARCHAR field there is a defined maximum allowed length depending on which engine you're using. For Innodb it's somewhere around 200 if I remember right, so make sure your varchar fields allow no more characters than you actually need.

Indexes make the save (insert/update) time a little longer, but unless you have a very write-heavy application, this is not a major concern.

MySQL :: MySQL 5.5 Reference Manual :: 8.3 Optimization and Indexes

Thanks

Signature

I build web things, server things. I help build the startup Veenome. | Remote Programming Jobs

{{ DiscussionBoard.errors[8572412].message }}

seasoned

11 years ago

I have a couple tables with over a million records, and they work fine. You can't simply define records, join them any which way, and expect them to work. If you do that in the worst way, a database will actually run SLOWER than flat files! Flat files are the fastest way to get the information that you can have. NOTHING can be faster, because they all use it as a base.

Databases are popular ONLY because they:
1. simplify changes.
2. provide an easy standard interface.
3. handle locking.
4. have a lot of labor intensive routines that are slow BUT, IF USED RIGHT, can make the access of info appear to be much faster than expected!
5. they have a few other features, such as network access.

I could say more, but it would take several pages to cover the basics.

Steve

Thanks

{{ DiscussionBoard.errors[8572356].message }}

How to handle over 300K records with MySQL

Trending Topics

Google Ads, landing pages, and Clickbank links

Carry out better target audience research

YouTube gets shopping display tools and affiliate partnership updates

A Very Strange Internet Problem

Membership area with multiple income streams