Parse bad xml file

David Jobes david.jobes.sony at gmail.com
Fri Oct 10 13:46:14 CEST 2014


I was given a badly or poor formatted xml file that i need to convert to csv file:

<?xml version="1.0"?>
<resultset xmlns:dyn="http://exslt.org/dynamic">
<table name="SIGNATURE">
	<column name="ID" type="String">	</column>
	<column name="NUM" type="Integer">	</column>
	<column name="SEVERITY_ID" type="Integer">	</column>
	<column name="NAME" type="String">	</column>
	<column name="CLASS" type="String">	</column>
	<column name="PRODUCT_CATEGORY_ID" type="Integer">	</column>
	<column name="PROTOCOL" type="String">	</column>
	<column name="TAXONOMY" type="String">	</column>
	<column name="CVE_ID" type="String">	</column>
	<column name="BUGTRAQ_ID" type="String">	</column>
	<column name="DESCRIPTION" type="String">	</column>
	<column name="MESSAGE" type="String">	</column>
	<column name="FILTERTYPE" type="String">	</column>
	<data>
		<r>
			<c>00000001-0001-0001-0001-000000000027</c>
			<c>27</c>
			<c>2</c>
			<c>0027: IP Options: Record Route (RR)</c>
			<c>Network_equip</c>
			<c>10</c>
			<c>ip</c>
			<c>100741885</c>
			<c>2001-0752,1999-1339,1999-0986</c>
			<c>870</c>
			<c></c>
			<c></c>
			<c></c>
		</r>


I have been able to load and read the file line by line, but once i get to the r line and try to process each c(column) that is where it blows up. I need to be able to split the lines and place each one or the r (row) on a single line for the csv.

i have a list set for each one of the headers based on the col name field, i just have been able to format properly. 



More information about the Python-list mailing list