Python : looping through each row of html table based on headers -

- June 15, 2011

i'm new python(3.4) , have parsed html table headings[ ] , rows[ ] , cells [ ] wish store each of these table (mysql) field names being items in headings[ ]

there 4 headings("data0","data1","data2","data3") there 6 rows

the code there rudimentary using beautifulsoup:

soup = beautifulsoup(r.text) table = soup.find("table") cells = [] rows = table.findall('tr') headings = [th.get_text().strip() th in table.findall("th")] row in rows:     cell in row.findall('td')     cells .append(cell.get_text().strip())

i'm used doing case statements, or, heaven forbid, number of if statements. place them under cell in row.findall('td') . working counter , doing like:

for row = 0 len(rows)     cell = 0 len(headings)          select case cell     case = 0        (save cell contents field called headings[0]     case = 1       (save cell contents field called headings[1]        ...

i'm not worried saving part (yet) cant wrap head around not being able use counters. realize way beginner, should appreciate insight (and brain)

Search This Blog

Running

Python : looping through each row of html table based on headers -

Comments

Post a Comment

Popular posts from this blog

python - No exponential form of the z-axis in matplotlib-3D-plots -

javascript - How to display image with fancybox href attribute -

c# - "Newtonsoft.Json.JsonSerializationException unable to find constructor to use for types" error when deserializing class -