Running the RTClean tutorial

https://github.com/delgaudl/RTClean

Download

1
git clone https://github.com/delgaudl/RTClean
1
git clone https://github.com/HoloClean/holoclean

1. Create conda env

1
conda create -n RTC38 python=3.8

2. Modify requirements.txt

1
2
3
4
5
6
7
8
9
10
11
12
gensim==3.8.3
numpy==1.19.5
pandas==1.1.5
psycopg2-binary==2.9.1
pyitlib==0.2.3
pytest-xdist==3.6.1
python-Levenshtein==0.12.2
scikit-learn==0.24.0
scipy==1.5.4
sqlalchemy==1.3.24
torch==1.7.1
tqdm==4.50.2

If you are using proxy, you may need set:

1
2
export https_proxy=http://127.0.0.1:7890
export http_proxy=http://127.0.0.1:7890
1
pip install -r requirements.txt

3. Modify Code

Error Message:

1
2
    tic = time.clock()
AttributeError: module 'time' has no attribute 'clock'

Replace all time.clock() to time.time()

4. Test holoclean

examples/holoclean_repair_example.py

5. Install extra requirements

1
pip install rdflib pyfuseki

Running the RTClean tutorial
https://www.hardyhu.cn/2024/09/10/Running-the-RTClean-tutorial/
Author
John Doe
Posted on
September 10, 2024
Licensed under