[Tutor] BeautifulSoup - getting cells without new line characters

Kent Johnson kent37 at tds.net
Fri Mar 31 18:12:13 CEST 2006


jonasmg at softhome.net wrote:
> Kent Johnson writes: 
> 
> 
>>jonasmg at softhome.net wrote:
>>
>>> From a table, I want to get the cells for then only choose some of them.  
>>>
>>><table>
>>><tr>
>>><td>WY</td>
>>><td>Wyo.</td>
>>></tr>
>>>...
>>></table>  
>>>
>>>Using:  
>>>
>>>for row in table('tr'): print row.contents  
>>>
>>>   ['\n', <td>WY</td>, '\n', <td>Wyo.</td>, '\n']
>>>   [...]  
>>>
>>>I get a new line character between each cell.  
>>>
>>>Is possible get them without those '\n'? 
>>
>>Well, the newlines are in your data, so you need to strip them or ignore 
>>them somewhere. 
> 
> I want only (for each row) to get some positions (i.e. 
> row.contents[0],row.contents[2]) 

It sounds like you should just work with row('td') instead of 
row.contents. That will give you a list of just the <td> elements.

Kent



More information about the Tutor mailing list