Hi there,
I'm supposed to integrate different sources of data into one database. I am given these types of sources:
- .txt
- .csv
- .html
- .xls
- access file as well.
what attributes should I be looking at to construct a global schema because all of them have different attributes? But I can say there are a few things in common between them like name, DOB, Country, CountryName.
A few hints on how to approach it will be good. Any useful website that illustrates or explains well the concepts would be awesome.
Thank you.