I tried out SyntheticUsers, so you don't have to
Using AI as a replacement for interviewing actual users is a brilliant idea if you want to look like you made an effort, but are really looking to fill the page with superficial, stereotyped info.
“Syntheticusers.com uses advanced AI and natural language processing to generate synthetic personas that can mimic real human behavior.”
I decided to test it out. First an overview of a recent study we conducted with 14 real humans in Oakland:
We worked with the Oakland Unified School District to understand what low income and immigrant parents need in the process of finding a school for their child. The paper titled “Not Another School Resource Map: Meeting Underserved Families’ Information Needs Requires Trusting Relationships and Personalized Care” was published in the ACM CSCW 2022 conference and received a recognition for contribution to Diversity and Inclusion.
Initially, we set out to study how the “Oakland school finder” dashboard can be made more accessible and usable for our target population. In the course of our work though, we learned that there was a mismatch between the information families need and the information that is readily available online.
We found that the parents in our sample were seeking a high quality education for their child, but faced challenges in reaching this goal, including finding useful and relevant information about the schools available, and making choices between schools that balanced access to well-resourced schools against other considerations like safety, inclusion, and convenience. Also, families face an overwhelming number of options, which exacerbates the challenge of finding relevant and helpful information. We found that a number of “parent advocates” — people who work at the school district, individual schools, or local non-profits — support parents during the enrollment process and play a big role in filling this information gap. Parent advocates provided relevant and concise information to families by first building a trusting relationship with them, learning about their personal circumstances and priorities, then connecting them with the right information or resources. There are over 100 schools in Oakland, so much of this work involved reducing information overload for families. In addition, this support often extended beyond enrollment to address other survival needs like housing, food, and healthcare.
So, overall, we found that the school dashboard was not providing the information that the participants needed, or in the words of one participant:
“it’s geared towards parents who are looking for certain things. It’s geared toward a middle-class, upper middle-class aesthetic”
And instead, parent advocates’ strategies focused on personal relationship building and trust and highlighted the importance of attending to individual families’ circumstances and priorities.
So… let’s see what SyntheticUsers would say about this problem. You can see the full interview transcripts here.
SyntheticUsers first asked me to define my users.
“Getting Started? We can help!
This is where you can add your audiences. Get specific, for instance: 34 year old worker in the fintech industry, married, with two dogs and a love of rugby.”
So I typed in: Low income, immigrant parent in Oakland who does not speak English.
Next, it asked me to define my problem: How do you find out information about which school to send your child to?
Next, it asked for a proposed solution to the problem: A dashboard that provides information about school locations, test scores, attendance statistics.
After clicking submit, a loading screen popped up that displayed the following sentences, one after the other:
Identifying potential interviews and checking availability … Scheduling interview … Conducting interview, taking detailed notes … thanking participants
Let’s set aside for a minute the ridiculousness of pretending that actual interviews are being scheduled and analyzed (sigh), I received back a report with 6 interviews. Of these 6, 4 said that they lived in Oakland, one lived in San Pedro Sula, Honduras, one in Karachi, Pakistan, and one in Tehran, Iran. Let’s also set aside the factual inaccuracies and focus on the substance of the interviews. I’ve summarized the main findings from the SyntheticUser “interviews” below:
Participants said that the dashboard would be “very effective,” “very useful,” and “very helpful” in solving their problems (6/6) and rated it on average 3.4/5.
Challenges to the dashboard idea raised by participants are:
limited internet access
not enough digital literacy
support for languages other than English
keeping information reliable and up to date
providing information beyond test scores such as extra curricular and safety records
Our own in-depth interviews over months with real parents found that the dashboard was a misguided solution and that what was really meeting our participants' information needs was through trusting relationships that met them where they were and went beyond information about schools. On the other hand, the main results from SyntheticUsers “interviews” are exactly what should be expected from a pattern synthesis engine1: stereotypically likely challenges for low income immigrant families that lack substance about their actual real lives. This is not a challenge that might be improved with more fine-tuning or better models. A pattern synthesis engine is fundamentally unaware of anything in the real world, and can not produce any "insights" beyond whatever patterns exist in its training data. Priming it by telling it to act as though it's someone else does not change this basic fact.
Finally, apart from the fact that misguiding users by acting as though a pattern synthesis engine can actually be “interviewed” is wrong, I’m also not sure why it’s even remotely useful. The whole point of spending the time to interview people and then spending a lot more time analyzing the large amounts of data gathered, is the ability to connect with them, build trust, dig deeper, ask them to share stories, and learn about their feelings and emotions. Pattern synthesis engines have none of those. They can generate a lot of text that make it look like they do, but there’s no substance there. In fact, basically the same information can be extracted from ChatGPT by simply asking it the question directly:
Imagine a low income, immigrant parent in Oakland who does not speak English. They are looking for information about which school to send their child to. A potential solution to this problem is a dashboard that provides information about school locations, test scores, attendance statistics. Create a list of challenges that the dashboard idea may face.
ChatGPT response (summarized here, full response below2):
Language barriers: Since the target audience does not speak English, the dashboard would need to be available in multiple languages, which could increase the development cost and maintenance of the platform.
Technology Access.
Understanding of the Data: The information provided in the dashboard may be overwhelming or confusing for parents who are not familiar with the school system in Oakland. Providing some guidance on how to interpret the data could help overcome this challenge.
So my main takeaways are, do not do this: it does not provide any substance about actual people’s lives beyond what is statistically most likely (see stereotyping), is wasting energy and producing unnecessary carbon, and is unethically misleading people to think that pattern synthesis engines can be scheduled for an interview and even thanked after.
“pattern synthesis engine” is a much more accurate term to describe the AI models we see today such as LLMs. I first read it here.
There are several challenges that a dashboard designed to provide information about schools to low-income, immigrant parents who do not speak English in Oakland may face. Some of these challenges include:
Language barriers: Since the target audience does not speak English, the dashboard would need to be available in multiple languages, which could increase the development cost and maintenance of the platform. Moreover, it may require professional translations to ensure that the information provided is accurate.
Technology Access: The target audience may not have access to technology or the internet. This could make it challenging for them to access the information provided on the dashboard, making the dashboard less effective.
Understanding of the Data: The information provided in the dashboard may be overwhelming or confusing for parents who are not familiar with the school system in Oakland. Providing some guidance on how to interpret the data could help overcome this challenge.
Data Accuracy: The accuracy of the data provided in the dashboard could be questioned, which may reduce the trust of parents in the system.
Overcrowding: Some schools in Oakland may be overcrowded, and the dashboard may not provide any information on this aspect. This could lead to parents enrolling their children in schools that cannot accommodate them, resulting in a waste of time and resources.
School ratings: School ratings based solely on standardized test scores can be misleading and may not provide a complete picture of a school's quality.
Availability of information: The information available on the dashboard may not be updated or may be incomplete, making it difficult for parents to make informed decisions.
Social Stigma: Some parents may feel hesitant to use the dashboard due to social stigma associated with receiving help or accessing public resources.
Addressing these challenges requires careful consideration of the user needs, adequate resources, and collaboration with community organizations to ensure that the dashboard is effective and accessible to the target audience.
I am backing Niloufar in this. It's not that she "did not use the tool correctly." It's that the tool is doing something inappropriate and unethical.
For design & strategic research we want to build new knowledge about people's approaches outside of what is already experienced or known by a team. To build this new knowledge we need to understand new meaning communicated to us by people who have done a lot of interior cognition (defined as: inner thinking, emotional reactions, personal rules) about the purpose they addressed. ChatGPT is not a source of meaning. ChatGPT is more like a search engine, finding words that have been published in relation to other words, all already existing in the training data. https://medium.com/@indiyoung/insta-personas-synthetic-users-fc6e9cd1c301
Hi Niloufar, thank you for taking the time to review our app.
We obviously disagree with your analysis and your conclusions.
Let's review.
From you post:
Parents were seeking a high quality education for their child, but faced challenges in reaching this goal but had several challenges:
1 - finding useful and relevant information about the schools available
2 - making choices between schools that balanced access to well-resourced schools against other considerations like safety, inclusion, and convenience
3 - overwhelming number of options
Some quotes from the synthetic user interviews you shared:
1 - "This issue is causing me anxiety and stress because I want to make the best decision for my child's education, but I don't know where to turn for reliable information."
" I have been struggling to find information about the different schools, their curriculums, and their reputations."
"This impacts me greatly because I want the best for my daughter, but I don't even know where to start looking for reliable information about schools."
2 - "Finally, I'm also concerned about the safety of the schools in my area, and whether my child will be able to stay safe while attending."
"Lastly, I am concerned about the safety of the schools in the area. There have been news reports of violence in some schools in the past, so I want to make sure the school I choose for my child is safe."
"Additionally, the dashboard may not reflect other factors that are important to me, such as the school's cultural responsiveness or climate. "
"Finally, this dashboard provides statistical data only, and the families need more in-depth information about schools such as school culture or individual teacher quality, etc. which can sometimes be more critical than just data."
"One issue is transportation. I live in a neighborhood with poor public transportation, which makes it difficult for me to visit schools that are located further away. "
"Secondly, can the dashboard also provide information about transportation options for the schools? This may be particularly important for parents who do not have access to a car."
"For example, the dashboard might indicate high test scores, but it might not provide information about the quality of the teachers, extracurricular programs, and other aspects that are important to me as a parent."
3 - "Another challenge is that there are many different types of schools to choose from, and I don't know which one would be the best for my child. For example, there are charter schools, public schools, and private schools, and they all have different strengths and weaknesses. "
"This solution provides a centralized location for me to access information about schools nearby without having to navigate multiple websites or online sources. "
Regarding the average score given by synthetic users ( 3.4/5), if fits your own results since, as you recognize in your paper, "...some parents in our sample found this kind of data useful...".
If you were willing to give synthetic users a chance and see how they really compare, you could have used the problem exploration mode of our app. Maybe that wasn't the goal.
But we did and we believe the results align with the results of your own research. Link: https://app.syntheticusers.com/summaries/f055d3c2-2a37-43a0-9b40-fcb3a3baa327
Now, we don't believe this will change yours or anyone's mind. People look at our product and, even before trying it, make a decision of giving it a chance or not.
We don't want user researchers, other product people or academics to stop talking to people. We want to help them be better equipped so, when they do, they can dig deeper and have even more nuanced conversations and explorations.
What we also want is to help people who are priced out of research or too time constraint to do any, to have a real chance at better understanding the people they are building products for. There are too many products and features launched with zero research and we want to help that number reduce.
Even well resourced teams can struggle to find the people they want to help as your paper clearly shows.
Two quotes from it show this really well:
"We met with these partners weekly to identify our research questions and develop a recruitment strategy over the course of **several months** ." (emphasis ours)
"Our sample in this paper is small because we spent **significant time and effort recruiting participants** and building relationships with community groups."
" responses to this survey were very slow, and most parents who completed it did not respond when we tried to set up an interview."
These are the type of problems we are trying to solve.
If you are available, we would welcome constructive feedback how to to make our product better