Here is part of the html
<body>
<table>
<tr>
<td id="ctl00_ContentPlaceHolder1_TD_Unlimited_Usage_PerDay" class="form_label"
colspan="2">
<table width='100%' cellspacing='2' cellpadding='0' border='0'>
<tbody>
<tr>
<td align='center' bgcolor='#E1E1E1'>Date</td>
<td align='center' bgcolor='#E1E1E1'>Download Usage</td>
<td align='center' bgcolor='#E1E1E1'>Upload Usage</td>
<td align='center' bgcolor='#E1E1E1'>Total Usage</td>
</tr>
<tr>
<td class='table_item' align='center' bgcolor='#EBEBEB'>2012-11-02</td>
<td class='table_item' align='center' bgcolor='#EBEBEB'>1.269 GB</td>
<td class='table_item' align='center' bgcolor='#EBEBEB'>0.286 GB</td>
<td class='table_item' align='center' bgcolor='#EBEBEB'>1.554 GB</td>
</tr>
<tr>
<td class='table_item' align='center' bgcolor='#EBEBEB'>2012-11-03</td>
<td class='table_item' align='center' bgcolor='#EBEBEB'>2.602 GB</td>
<td class='table_item' align='center' bgcolor='#EBEBEB'>0.152 GB</td>
<td class='table_item' align='center' bgcolor='#EBEBEB'>2.754 GB</td>
</tr>
Here is part of the html
Now I need to create hash with date as key and 3 values (download,upload,Total)
This hash will be used for furher processing.
Then I need to create csv from the hash to insert the data into a database (date,download,upload)
The problem I am facing is the html extracted from the web page is not formated in a logical way, by that I mean lines inside the file do not follow "structured" form. Also, I do not want to use any cpan module, because I feel more comfortable "driving" manual ...I hate automatic !!!! just kidding
So far, I have used many approaches one is to use regex !!! and I realized that I am creating problems rather than solving them. I can list some of these approaches, but may be a proven approach from you guys and gals can direct me to solve this problem.