Skip to content

Group project for Information Retrieval and Data Mining 2016 -- Project 8 Mining fine art paintings for creativity understanding

Notifications You must be signed in to change notification settings

helanto/irdm-2016

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

irdm-2016

This is the group project for Information Retrieval and Data Mining module. Group Project Option 8 - Mining fine-art paintings for creativity understanding


Data

All the data files are downloaded from http://www.wga.hu/index1.html

Team - Members

Name email
Ilias Antoniou [email protected]
James Hale [email protected]
Cyrus Parlin [email protected]

Code

We used python 2.7

Need to install the following modules:

leargist
PIL

Instructions for installing leargist can be found here: https://pypi.python.org/pypi/pyleargist

Procedure

We will briefly describe the procedure needed to be followed in order to run the code.

Procedure

  • We run save_images.py script in order to save images on disc. This is needed in order to extract classemes and picodes features later.
  • Default setting is to save images under /images directory. A listimages.txt file is generated as well.
  • We need to download vlg extractor from vlg. We need to download parameters as well (5GB)
  • For linux OS we need opencv2.3.0. This release is quite old so we need to download it from opencv2.3. After downloading it we need to compile it. Build FFMPEG option gives an error so we remove it as we do not need video analysis. We follow similar procedure as described here.
  • build configuration: cmake -D CMAKE_BUILD_TYPE=RELEASE -D WITH_FFMPEG=OFF -D CMAKE_INSTALL_PREFIX=/usr/local -D BUILD_ZLIB=ON -D BUILD_PYTHON_SUPPORT=ON ~/opencv/opencv-2.3.0
  • Now we are ready to extract classemes and picodes using vlg. We run the following command: ./vlg_extractor --extract_classemes=FLOAT --extract_picodes2048=FLOAT --parameters-dir=parameters/parameters_1.1 ~/PycharmProjects/irdm-2016/listimages.txt ~/PycharmProjects/irdm-2016 ~/PycharmProjects/irdm-2016/features. It saves features under /features/images directory if we stick with default configuration.
  • We run main.py script for analysis.

Literature Review

Notes:

Results

Most similar paintings:

  • Regatta at Sainte-Adresse
  • It seems like the same painting with slightly different colours appear twice in the dataset. Score: 0.8866

Most different paintings:

  • The Drunken Silenus
  • The Grosse Gehege near Dresden
  • The paintings are quite different with similarity score: 0.188156
  • Interestingly enough the most similar painting to Ruben's Silenus is the The Drunken Hercules of the same artist. The similarities between the two paintings are obvious:
  • The Drunken Hercules

PageRank analysis on the network:

After performing PageRank analysis on the network proposed by Elgammal and Saleh we end that the top-10 bost important (novel and influential) nodes of the network are the following:

About

Group project for Information Retrieval and Data Mining 2016 -- Project 8 Mining fine art paintings for creativity understanding

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages