You are here: Re: [PHP] How to find random records in a subset? « PHP « IT news, forums, messages
Re: [PHP] How to find random records in a subset?

Posted by "Richard Lynch" on 06/03/05 22:18

On Thu, June 2, 2005 4:31 pm, Brian Dunning said:
> I am using a routine to find 50 random records in a large MySQL
> database (about a million records) where I generate a list of 50
> random unique ID's, and then use MySQL's "in" command to find them. I
> can't use "order by rand()" due to its performance hit.
>
> But I have to take it one more step: I want to first limit my found
> set to those matching a different search criteria, and then find 50
> of those.

If your search criteria is on an indexed field, you may find that the
order by rand() is much much much less of a problem once you winnow your
record set down to the search criteria.

Or, more precisely, if your search parameters get you a small result set,
the order by rand() will be "fast" because that is done AFTER the smaller
result set is determined.

If your search parameters get you a HUGE set, order by rand() not so fast.

> Anyone? Can this be done all within MySQL, or is it going to require
> some humongo PHP arrays?

How do you generate a list of 50 random unique IDs in the first place?

How to you guarantee that all those IDs are in the database?

What if a record gets deleted?

I suppose *ONE* option to try would be to just get the full result set,
and then use PHP's mt_rand() to skip forward/backward a random number of
records, using something like:

//untested code
$count = mysql_num_rows($result);
$ids = array();
while (count($ids) < 50 && count($ids) < $count){
$random = mt_rand(0, $count - 1);
mysql_data_seek($result, $random);
$id = $result['id'];
//Note use of $id as key, so duplicates are ignored.
$ids[$id] = $id;
}

This will perform MISERABLY when your number of records is closer to 50.

It would be okay only if all searches result in tons and tons of records.

One does have to wonder:
If the user has put in multiple search criteria, why order randomly? Why
not put best matches first?

--
Like Music?
http://l-i-e.com/artists.htm

 

Navigation:

[Reply to this message]


Удаленная работа для программистов  •  Как заработать на Google AdSense  •  England, UK  •  статьи на английском  •  PHP MySQL CMS Apache Oscommerce  •  Online Business Knowledge Base  •  DVD MP3 AVI MP4 players codecs conversion help
Home  •  Search  •  Site Map  •  Set as Homepage  •  Add to Favourites

Copyright © 2005-2006 Powered by Custom PHP Programming

Сайт изготовлен в Студии Валентина Петручека
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация