☕ NLP Engineer from Paris, France | パリ、フランスからの自然言語処理エンジニア
💾 & 🇯🇵 Master's degree, in Natural Language Processing - Multilingual engineering, Japanese at I.N.A.L.CO
(I.N.A.L.C.O 東洋言語文化学院、自然言語処理修士 | 卒業)
💵 Master's Degree in International Business (EMIB) at ESCP Europe
(ESCPヨーロッパ・ビジネススクール、経営管理修士 | 卒業)
🏁 I have managed AI and business initiatives for prestigious brands, publishing companies, technology companies, and financial institutions for over 10 years. Born and raised in Paris, I am fluent in French, English, Japanese (JLPT N2), and Spanish. I am passionate about quantitative finance, network effects, and natural language processing.
Please feel free to connect with me on GitHub, LinkedIn, Discord, or HuggingFace! 😃
Libérté, égalité, architecture open-sourcé... Open-source RAG LlamaIndex and multilingual LLM from Mistral AI in a low-resource environment over financial statements:
- Vanilla RAG (retrieval-augmented generation) with LlamaIndex and pgvecto.rs from TensorChord
- Quantized model of Mistal8x7B from Mistral AI and LlamaCPP inference tool
- Test of BGE-M3 embedding model from BAAI
- Plus benchmark in German on Porsche AG, in French on Hermès, and in Italian on Brunello Cucinelli
Generative AI for all - Quick implementation with an open-source RAG LlamaIndex and Japanese LLM from ELYZA, Inc. in a low-resource environment over legal documents:
- RAG (retrieval-augmented generation) is LlamaIndex with a vanilla Hybrid search (combining retrieval from both text search and vector search)
- Japanese LLM “ELYZA-japanese-Llama-2-7b-instruct” created by Japanese startup, ELYZA, Inc.
- Open-source database PostgreSQL transformed into a vector database by the great library PG Vector
- Plus Q&A analysis in Japanese, embedding pricing war, and generative AI strategy of France, USA, and Japan
NER-Luxury is a fine-tuned XLM-Roberta model for the subtask N.E.R (Named Entity Recognition) in English. NER-Luxury is domain-specific for the fashion and luxury industry with bespoke labels. NER-Luxury is trying to be a bridge between the aesthetic side and the quantitative side of the fashion and luxury industry.
- 38.063 sentences in English (covering the beauty, fashion, and luxury industries)
- 32 labels from companies, groups, and holdings to luxury brands, models, and magazines
- Loss: 0.3990, Accuracy: 0.9427, F1: 0.7879
To promote communication between NLP practitioners, I created a vocabulary deck of 420 technical words for Anki Pro (learning software based on space repetition) in Japanese, English, and French for natural language processing.
-
Test on earning calls of Fast Retailing Co., Ltd 2022 with Whisper OpenAI
-
LDA analysis on shareholder's letter
-
Sustainability as a strategy in fashion and in NLP
- Introduction from Pregroup Grammar, DisCoCat, to Lambeq
- Specificities of the Japanese language
- Pregroup Grammar in free word order
- Quick financial analysis
- Why is TikT0k a mistake in luxury?
- Black Scholes (closed form) and the Greeks implemented in Python and in C++
- Linguistic specificities for NLP in Japanese
- Leveraging spaCy framework, and Ginza model, and building a custom NER model
- Insights on Masayoshi Son (Softbank), Hiroshi Mikitani (Rakuten), and Haruhiko Kuroda (Bank of Japan)
- Few thoughts on NLP in Japanese
- Naive OLS Linear regression model
- Facebook's Prophet model
- Bayesian Linear regression model with STAN
- And a quick introduction to the Hierarchical model
- Natural Language Processing using NLTK and Vador
- Time Series for sentiment analysis
- "Meme" as a marketing or investment strategy?
- Insights on the human perceptions of leadership on Twitter
- Financial analysis of luxury groups in 2020 during a high volatility state with the spread of SARS-Cov2
- Found an increasing correlation on Y-o-Y between luxury groups and the Facebook Inc. ecosystem
- Conclusion for stakeholders and shareholders