if i had a data file with html/xhtml tags:
Code:
<html>
<head>
<title> data file </title>
</head>
<body>
<center><h1>
heading 1
</h1></center>
<b>bolded</b>
<P>paragraph</P>
<P>
<br />
how would get a python program to read ONLY the start and end tags and and enqueue them in a queue?
for example, the queue for this would look:
<html>
<head>
<title>
</title>
</head>
<body>
<center>
<h1>
</h1>
</center>
<b>
</b>
<P>
</P>
<P>
<br />
Thanks!