Skip to content

Releases: MaartenGr/Concept

v0.2.1

05 Nov 10:33
d270607
Compare
Choose a tag to compare
  • Fixed issue when loading in more than 40.000 images
  • Fixed transform only working with pre-trained embeddings

v0.2.0

02 Nov 10:59
b3fe6b1
Compare
Choose a tag to compare

Added c-TF-IDF as an algorithm to extract textual representations from images.

from concept import ConceptModel

concept_model = ConceptModel(ctfidf=True)
concepts = concept_model.fit_transform(img_names, docs=docs)

From the textual and visual embeddings, we use cosine similarity to find the best matching words
for each image. Then, after clustering the images, we combine all words in a cluster into a single
documents. Finally, c-TF-IDF is used to find the best words for each concept cluster.

The benefit of this method is that it takes the entire cluster structure into account when creating the
representations. This is not the case when we only consider words close to the concept embedding.

v0.1.1

01 Nov 07:18
a390976
Compare
Choose a tag to compare
  • Fix RAM issues
  • Update documentation
  • Add ftfy dependency
  • Fix .visualize_concepts
  • Added .search_concepts

v0.1.0

27 Oct 14:35
05031fd
Compare
Choose a tag to compare