Skip to content

alemtgetu/resume_job_desc_matching

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

start

when thinking of resume and JD matching how can one approach to solve the problem if they dont have NLP/ML knowledge

One obvious way we can approach this problem by using naive word matching using pattern matching algorithm and finding each skills listed in the resume by searching the JD required skills for a matching pattern of a skill found in the resume. We will do this for each skill found in the resume.

For example for a JD with skills required "aws, python, leadership, machine learning" a perfect match would be a resume with skills "aws, python, leadership, machine learning"

But if the resume sills have the following "python, team leader, support vector machine, cloud computing" the algorithm fails to match the resume with the JD with high score even though all the skills in the resume "support vector machine" is type of "machine learning", "team leader" and "leadership" are same thing, and "cloud computing" includes "aws".

So how do we solve this problem

Lets understand Word2Vec

Understanding Doc2vec

Tutorials

dataset

ToDos

  1. baseline doc2vec model using cosine distance matching
  2. organize resumes_corpus in to one json.gz file with {label: "label fileName.lbl", text: "resume text fileName.txt"}
  3. cluster resumes and job descriptions by category
  4. improve resume matching using clustering and doc2vec model
  5. Organize datasets
    • resume dataset
    • job description
    • use the categories from the jd dataset to be applied to the

Findings

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published