You are here: Re: not sure who to ask... sorting data from a webpage... « HTML « IT news, forums, messages
Re: not sure who to ask... sorting data from a webpage...

Posted by mbstevens on 10/17/98 11:22

Eric wrote:
> Hi there, I'm wondering if anyone might now how I can sort through
> data from a web site.
>
> Here's what I mean: I go to a page like this,
> http://biz.yahoo.com/research/earncal/20050727.html
>
> and make lists in a text file that look like this,
> """"""""
> July 27/05
> am:
> zbra ycc xel wec wlp wlm vcg vitx uco umc tup trps twti tmo mos faf ba
> tin tds tem sup su seo fon see std res rcl rol rok resp quot pub px

> I do this by hand. As you can see there are 3 main categories,
> 1)before market open, 2) time not supplied and, 3) after market close
> and some specific times of earnings release.
>
> Can any one tell me how to create these lists without typing them all
> out by hand?
>
> thanks for any help
> Eric

It could be completely automated all the way from the web page to a
formatted file on your local machine.

You could use Perl's LWP::Simple module to get the webpage and put it
into a variable.

Next you could use Perl's HTML::Parser module to extract the plain text
you want from the HTML. You would likely also have to use the split
function and regular expressions as suppliments to this.

Perl has sophisticated sorting facilities once you get the information
you want sucked into an array. The array could then be written in
whatever format you want to a file.

There is lots of Perl documentation online, and you can get ActivePerl
for Windows at activestate.com. If you havn't programmed Perl before
there will be a learning period, but it will automate your task
completely. Similar facilities exist for Python, the language the
Google search engine was written in.
--
mbstevens
http://www.mbstevens.com/

 

Navigation:

[Reply to this message]


Удаленная работа для программистов  •  Как заработать на Google AdSense  •  England, UK  •  статьи на английском  •  PHP MySQL CMS Apache Oscommerce  •  Online Business Knowledge Base  •  DVD MP3 AVI MP4 players codecs conversion help
Home  •  Search  •  Site Map  •  Set as Homepage  •  Add to Favourites

Copyright © 2005-2006 Powered by Custom PHP Programming

Сайт изготовлен в Студии Валентина Петручека
изготовление и поддержка веб-сайтов, разработка программного обеспечения, поисковая оптимизация