Distributed Near Duplicate Detection for Big Data
- Java 8 or higher
- Maven 3.6+
- Docker & Docker Compose (optional, for databases)
To start the required MongoDB and MySQL databases:
docker-compose up -dmvn clean install
mvn exec:java -Dexec.mainClass="org.dudu.Main"If you use ∂u∂u in your research, please cite the below paper:
- Kathiravelu, P., Galhardas, H. and Veiga, L., 2015, October. ∂u∂u Multi-Tenanted Framework: Distributed Near Duplicate Detection for Big Data. In OTM Confederated International Conferences" On the Move to Meaningful Internet Systems" (pp. 237-256). Cham: Springer International Publishing.