Lab 9: Revision

For Lab 9, we will not have new material. You can work on the exercises that you didn’t have time to work on such as using Metanome to discover functional dependencies. You can also work on the remaining exercises from Lab 8.

Discovering functional dependencies using Metanome

FD Discovery

Download the Metanome profiler and a set of the functional dependency discovery algorithms, run one of the algorithms on csv file (you can find examples of datasets on the same website), and report the discovered FDs. Metanome is built using JAVAso you will need to install it on your computer.

Solution: You will need to follow the following steps:

  • If you don’t have java on your machine, you will need to install it. Check link for instructions and available versions.

  • Download Metanome from the link.

  • Extract the contents of the zip file and save the content in a folder of your choice.

  • Download the algorithms Tane, fdep and FastFDs and move them inside the folder (…../deployment-1.2-SNAPSHOT-package_with_tomcat/backend/WEB-INF/classes/algorithms/).

  • Open Windows Powershell or Mac/ Linux terminal and change the current working directory to the directory (…../deployment-1.2-SNAPSHOT-package_with_tomcat/) and type (./run.sh). You may need to use run.bat for Windows. Wait until the server starts correctly.

  • Open a web browser and type (http://localhost:8080/) in the address bar. You will see the user interface of the tool. You should be able to see the algorithms as in the figure below.

  • Try to run one of the algorithms on the available datasets. and record your findings.

  • If you want to use your own data, copy the csv files into (…../deployment-1.2-SNAPSHOT-package_with_tomcat/backend/WEB-INF/classes/inputData/).

Metanome Interface

Parts from Lab 8

Continue to work on the exercises from lab 8 such as data reduction using PCA and discretization. Answers for lab 8 will be posted tomorrow after the lab directly.