msmemory_archive (
msmemory_archive) wrote2008-02-29 03:40 pm
![[personal profile]](https://www.dreamwidth.org/img/silk/identity/user.png)
HTML text to Office table
O mighty intarweb: Is there a quick way to take data that are in html, laid out visually like a table but actually simple text, and import it into a Word or Excel table, with the data properly separated into table columns? I know several brute force ways, but am I missing a friendly little tool that'll do it quick?
ETA: I'm trying to take this data (http://op.eastkingdom.org/Awards/Pelican.html and Laurel.html and Chivalry.html) and put it into a single grand date-sortable table. I figure this is a nice little project for a snowy Saturday morning.
ETA: I'm trying to take this data (http://op.eastkingdom.org/Awards/Pelican.html and Laurel.html and Chivalry.html) and put it into a single grand date-sortable table. I figure this is a nice little project for a snowy Saturday morning.
well...
I just click drag to sleect the table cells
and then paste it right into Word.
It works a treat for me.
Re: well...
Its just text, right?
Spaces format the text?
I copy the text to a notpad.
then I perform the following replacements:
replace all (space)(space) with (tab)
replace all (tab)(tab) with (tab)
replace all (tab)(space) with (tab)
(ctrl-h brings up the edit-replace dialog in Note pad)
then copy paste it into EXCel
It works in most cases.
You could probably write a word macro that does this but why bother unless you have to do this all the time?
Re: well...
All you have to do is copy it to Notepad, replace all double spaces with a tab, copy it and paste it right into an EXcel table.
I just did that, and it worked great.
no subject
no subject
no subject
no subject
Save the file. Open it in Word (you need to look at "All Documents," not jus "ALl readable docs") and Save as.. a text file. CLose the file, and open it in Excel. Tell the importer that it's a fixed width file, and it'll work.
This worked 'cause the first field, date, was always the same width. It's a lot more of a pain if that's not true.
no subject
-- having three undefined but named macros (m0, m1, m2) already assigned as keystroke commands and available for on-the-fly redefinition
-- using the pipe '|' as field delimited, since it's hardly ever used in the kind of datafiles you're talking about. Of course you have to search and test first.
Then Excel has a command to read in text files. Make sure to select *all* the fields and define them as Text (not "General") at the appropriate step, or various kinds of data will get effed up.
no subject
-- G
no subject
no subject