Re: Simple question on string extraction — PHP Programming Language

You are here: Re: Simple question on string extraction « PHP Programming Language « IT news, forums, messages

Posted by WombatBoy on 06/10/73 11:25

I've been doing something similar myself, but wanted to avoid the chance of
getting an accidental early string match.

The strpos() function will let you locate a string within another string
(I'm assuming here that you've got the whole html page as a single string),
and, if required, you can specify a starting position.

So something like

$p1 = strpos($rec,"</header>");

would let you get beyond the html header, then

$p2 = strpos($rec," by ",$p1);

would let you find the first occurrence of " by " beyond position $p1 (or
maybe "by<", depending whether there's a space there or not)

then you can search for and in the same way, adjust your sums a
bit, and get

$author = substr($rec,$start,$length);

where $start will probably be something like $p1+3 and $length something
like $p2-$p1-2, or whatever it turns out to be, and whichever way round $p1
and $p2 end up.

Hope this helps. As an alternative you might try the explode function using
" by " as the string to split $rec on, and then check each array element.

"Epetruk" <nobody@blackhole.com> wrote in message
news:3njvqpF1sm7fU1@individual.net...
> Hi,
>
> I'm having to modify a PHP script even though I have little knowledge of
> PHP
> itself. The script extracts specific strings from an html file, and I need
> to it extract some further information.
>
> Specifically, each file represents an article written by an author. The
> author's name is typically preceded by a 'By' or a 'by', then it goes on
> till there's a carriage return.
>
> So for example, the file might contain something like this:
>
>
> The Need For Regeneration
>
> by John Smith
>
> We have seen the waste that has been produced....
>
> (rest of article)
>
>
> or
>
>
> How To Make Lots and Lots of Money Writing PHP
>
> by The Supreme Coder
>
> The first thing you need to know about making money is...
>
> (rest of article)
>
>
> So I need code that will start searching the file from the beginning for
> the
> words 'by ' or 'By ', then grab everything that follows that until it gets
> to a new line and assign that to a variable. In the examples I have given
> above, it would grab 'John Smith' and 'The Supreme Coder'. I've
> seen
> a function called preg_match which might do the job, but it uses regular
> expressions which I have little knowledge of.
>
> Would any person be so kind as to post what arguments I would need to call
> this function with?
>
> TIA,
>
> --
> Akin
>
> aknak at aksoto dot idps dot co dot uk
>
>

Navigation:

Next in forum: Re: Australians Only?
Prev in forum: session change from 4.2.3 -> 4.3.x?
Thread view: Re: Simple question on string extraction

[Reply to this message]

Удаленная работа для программистов • Как заработать на Google AdSense • England, UK • статьи на английском • PHP MySQL CMS Apache Oscommerce • Online Business Knowledge Base • DVD MP3 AVI MP4 players codecs conversion help

Home • Search • Site Map • Set as Homepage • Add to Favourites

Сайт изготовлен в Студии Валентина Петручека —
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация