Hello! I'm Danmeng CAI, a manager in the Global Division Department at Commerce Robotics, a dynamic start-up based in Chiba, Japan. My role involves overseeing the daily tasks of our team while also engaging in software development within the SaaS team, where I apply Large Language Models (LLMs) and other AI technologies to create innovative solutions. I hold an MA in International Area Studies, with a focus on North American regions, and an MS in Informatics, specializing in Natural Language Processing (NLP). My passion for nature conservation led me to complete a Certificate Program in Nature Conservation at the University of Tsukuba, Japan, where I explore the intersection of AI and environmental sustainability.
Based on previous literature on Multi-Target Feature Selection (MTFS) and psychology, this research proposes a MTFS method to mitigate overfitting when predicting psychological status using text data.
*Paper accepted by iiWAS 2024
Utilizing NLP techniques, we extracted nouns from Japanese tweets as language features. These features were then used in regression analyses to link with subjective well-being scores, followed by visualization through a new word cloud technique.
*Paper accepted by DEIM 2024
The IPBES is an intergovernmental body that has taken significant role in examining biodiversity trends. In this project, we leverage NLP techniques to undertake contextual semantic network examination of the expert comments from IPBES.
*Poster accepted by Tsukuba Conference 2023
In the world of AI and NLP, the quality of a prompt can make or break the outcome. Crafting an effective prompt is an art that involves careful iteration and understanding the nuances of language models. It also requires continuously refining your instructions. A well-crafted prompt can mean the difference between a vague response and a highly detailed, actionable output. In this blog post, I want to share my journey of iteratively improving a prompt to achieve better summarization results for meeting notes written in Japanese. --> Read More
Nested cross-validation is a powerful technique for evaluating the generalization performance of machine learning models, particularly useful when tuning hyperparameters. It involves two layers of cross-validation: an outer loop for assessing the model’s performance and an inner loop for hyperparameter tuning. --> Read More
Hello, data enthusiasts and social media buffs! Ever wondered if our personality types influence the emojis we use in tweets? Let’s embark on a delightful data journey to explore this connection. We’ll be using Python, some cool libraries, and a sprinkle of emoji magic to visualize how different MBTI personality types express themselves through emojis🚀 --> Read More
University of Tsukuba, 2022-2024
University of Tsukuba, 2022-2024
University of Tsukuba, 2018-2021
Washington University in St. Louis, 2019-2020
University of Tsukuba, 2014-2018
Years Worked: Oct. 2024 - Current
I managed a team of six full-time employees and over ten part-time staff across sales, monitoring, and customer success departments. Additionally, I contributed to software development within the SaaS team, applying Large Language Models (LLMs) and other AI technologies to develop innovative solutions.
Years Worked: Feb. 2024 - Sep. 2024
I collaborated on the LLM-Creative project, focusing on understanding the company’s workflow and exploring improvements to the creative production process using LLMs for more efficient, reproducible outputs. Additionally, I researched how to use LLM tools for generating advertising images through Midjourney and explored video generation methods like Runway Gen-3. I also developed a customized ChatGPT model that generates catchy copywriting from URLs and utilized OpenAI’s API in Google Apps Script to automate customer and market data analysis, enhancing workflow efficiency.
Years Worked: July 2024 - Aug. 2024
I focused on network analysis of conspiracy groups on Telegram, creating bipartite networks to explore the interactions between groups and authors. I utilized tools like NetworkX and Gephi to calculate key metrics such as degree centrality, betweenness centrality, and PageRank. I categorized groups based on the number of authors and messages, analyzed shared authors between groups, and developed methods to calculate group closeness.
If you'd like to get in touch, feel free to email me at: caidanmeng@gmail.com