Emerging Adulthood Measured at Multiple Institutions 2 (EAMMI2)

This project uses unsupervised learning techniques to explore a psychological survey of young adults. The survey has many sections, covering the attitudes, behaviors, and beliefs surrounding adulthood.

Self-worth / Confidence
1. I can solve most problems if I invest the necessary effort.
2. I can remain calm when facing difficulties because I can rely on my coping abilities.
3. I make independent decisions.
Mindfulness
1. It seems I am running on automatic, without much awareness of what I’m doing.
2. I break or spill things because of carelessness or not paying attention.
3. I tend not to notice feelings of physical tension until they really grab my attention.
Achievement
1. I am capable of supporting a family financially.
2. I am no longer living in parents' household.
3. I am settled into a long-term career.
Family
1. Marriage is an important aspect of adulthood.
2. Being capable of caring for children is an important aspect of adulthood.
3. Being capable of supporting parents financially is an important aspect of adulthood.
Support
1. I get the emotional help and support I need from my family.
2. There is a special person in my life who cares about my feelings.
3. I can count on my friends when things go wrong.
Self-control / Responsibility
1. I avoid becoming drunk.
2. I accept responsibility for my actions.
3. I use contraception if sexually active and not trying to conceive a child.
Neuroticism
1. My feelings are easily hurt when I feel that others do not accept me.
2. I feel that I am unable to control the important things in my life.
3. Is this period of your life a time of feeling stressed out?

Using Hierarchical Clustering on reduced feature set

Using Ward's linkage method to minimize within-cluster variance

Using Chi-squared test to test independence of the clusters with regard to the held out "Subjective Well-being" questions.

The young adults were binned by their cumulative SWB scores (low, neutral, high), and the distribution of each cluster was tested against that of every other cluster. The p-value, shown in the yellow triangle, is the probability of observing these (or more extreme) distributions given that the clusters are not independent.
Therefore, the lower p-values support the claim of independence.

Plot distributions of SWB and held-out demographics

As shown by the p-values regarding Subjective Well-being, Clusters 2 and 3 are similar, as are Clusters 4 and 5.

Note: The following are quick visual snapshots of some demographic distributions. No statistical tests have been run to confirm the differences.

Cluster 1 contains more of the Other gender.

Clusters 2, 3, and 4 show different age distributions.

Cluster 3 shows a different education distribution.

Conclusion

The features reduced to an interpretable set of topics.
The data clustered meaningfully around the held-out "Subjective Well-being" questions.
Upon first glance, it seems promising that the clusters contain different demographic distributions.

Next Steps

Dig into the differences of the clusters and confirm with statistical tests
Include open-ended text sections
- NLP / Sentiment Analysis
Include the "duration" feature, which measures the time spent answering each section
Use supervised learning to predict classification labels
- e.g. Subjective Well-being - low, neutral, high
Measure relationship between family/upbringing and belonging/support

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
data		data
images		images
main		main
misc_nbs		misc_nbs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eammi_slides.key		eammi_slides.key
eammi_slides.pdf		eammi_slides.pdf
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emerging Adulthood Measured at Multiple Institutions 2 (EAMMI2)

Table of Contents

Hypotheses

Technology

Data

Citation

In summary

Preprocessing

Execution

Using Non-negative Matrix Factorization for topic extraction.

Using Hierarchical Clustering on reduced feature set

Using Chi-squared test to test independence of the clusters with regard to the held out "Subjective Well-being" questions.

Plot distributions of SWB and held-out demographics

Conclusion

Next Steps

About

Releases

Packages

Languages

License

anthonybaulo/emerging-adulthood-psych-study

Folders and files

Latest commit

History

Repository files navigation

Emerging Adulthood Measured at Multiple Institutions 2 (EAMMI2)

Table of Contents

Hypotheses

Technology

Data

Citation

In summary

Preprocessing

Execution

Using Non-negative Matrix Factorization for topic extraction.

Using Hierarchical Clustering on reduced feature set

Using Chi-squared test to test independence of the clusters with regard to the held out "Subjective Well-being" questions.

Plot distributions of SWB and held-out demographics

Conclusion

Next Steps

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages