Diversity Supplement for Mining Social Media Big Data for Toxicovigilance
Project Number3R01DA057599-01S1
Former Number1R01DA057599-01
Contact PI/Project LeaderSARKER, ABEED H
Awardee OrganizationEMORY UNIVERSITY
Description
Abstract Text
Project Summary
The epidemic of substance use (SU) and substance use disorder (SUD) in the United States has been evolving
for decades. Both prescription and illicit drugs have been involved in overdose deaths over the years, with
notable increases in synthetic opioids (e.g., fentanyl & analogs) and psychostimulants (e.g., methamphetamine)
in recent years. The emergence of high-potency novel psychoactive substances (NPSs), such as fentanyl
analogs, have drastically contributed to rising deaths, and adversely impacted treatment engagement and
response. The COVID19 pandemic has further exacerbated the crisis, and recent studies have also highlighted
that substantial disparities exist in SUD treatment, research, interest, and response across different
subpopulations, with racial/ethnic minorities being disproportionately impacted. A key element to tackling the
crisis is improved surveillance. Specifically, there is a need for establishing novel approaches to provide timely
insights about the trends, distributions, and trajectories of the SUD epidemic, as traditional surveillance
approaches involve considerable lags. Many recent studies have identified social media (SM) as useful
resources for conducting SU/SUD surveillance. Many people use SM to discuss personal experiences, provide
advice, or seek answers to questions regarding SU/SUD, resulting in the generation of an abundance of
information. Such information can be characterized, aggregated and analyzed to obtain population- or
subpopulation-level insights, at low cost and in near real time. However, converting SM data into timely,
actionable knowledge is non-trivial since the data is big, complex, and noisy, requiring the development of
advanced, automated artificial intelligence methods. Funded by the National Institute on Drug Abuse, our past
work focused specifically on prescription medications (PM) and established the most sophisticated SM-based
data mining pipeline available to date. In the parent proposal, we are expanding our pipeline to attempt to solve
previously unaddressed problems including (i) detection of novel psychoactive substances—both prescription
and illicit, (ii) characterizing stigmatizing language, and studying long-term trends in the impact of substance use.
In the proposed supplement, we will focus on studying a specific substance that has emerged to be a national
problem—xylazine. The two specific aims of the supplement are as follows: (i) Characterize and quantify the
reported adverse effects and clinical and social impacts of xylazine, leveraging the state-of-the-art NLP and
machine learning methods and close-to-real-time big data from Twitter (X) and Reddit; and (ii) Analyze the impact
of xylazine over time across geographic locations and different population groups. The supplement provides
training on advanced data science, including machine learning, natural language processing (NLP), generative
AI and large language models (LLMs), and the applications of these areas in the broader substance use research
space. In addition to providing the opportunity to obtain expertise in such a unique domain, the supplement will
result in publicly available models made available via the HEAL initiative ecosystem.
Public Health Relevance Statement
Project Narrative
Current surveillance and reporting approaches for the epidemic of substance use (SU) and substance use
disorder (SUD) involve substantial lags, necessitating the need for the development of effective and innovative
surveillance strategies. Social media encapsulates an abundance of data about SU/SUD reported by people
from diverse subpopulations, making it possible to obtain insights into emerging substances that have limited
data available from traditional settings. In this supplement, we will utilize our existing data science infrastructure
(natural language processing and machine learning) to conduct detailed analyses of information on xylazine,
posted by people with lived experiences in social media platforms.
NIH Spending Category
No NIH Spending Category available.
Project Terms
AddressAdvanced DevelopmentAdverse effectsAdverse eventAreaArtificial IntelligenceBig DataBig Data to KnowledgeCOVID-19 pandemicCalibrationCessation of lifeClinicalCollaborationsComplexDataData ScienceDetectionDevelopmentDisparityDrug PrescriptionsDrug userEcosystemElementsEmergency MedicineEncapsulatedEpidemicExpert SystemsFentanylFundingFutureGenerationsGeographic LocationsHelping to End Addiction Long-termIllicit DrugsInformation RetrievalInfrastructureInterventionJournalsKnowledgeLanguageLived experienceLongitudinal StudiesLongitudinal trendsMachine LearningMentorshipMethamphetamineMethodsMiningModelingNational Institute of Drug AbuseNatural Language ProcessingOpioidOutcomeOutputPaperParentsPersonsPolicy MakingPopulationPopulation GroupProcessPublic HealthPublishingReportingResearchResearch PersonnelResourcesSocial ImpactsSourceStigmatizationStreamSubstance Use DisorderTarget PopulationsTimeTrainingTwitterUnited StatesWorkXylazineanalogartificial intelligence methodbiomedical informaticscohortcostdashboarddata accessdata analysis pipelinedata dashboarddata miningdata science infrastructuredata visualizationdeep neural networkempowermentethnic minorityexperiencefentanyl analoggenerative artificial intelligenceimprovedinnovationinsightinterestlarge language modelmachine learning methodnovelnovel strategiesopioid epidemicopioid useroverdose deathparent grantpsychostimulantracial minorityreal time monitoringresponseshot learningskillssocial mediasocial stigmasubstance usesurveillance strategysynthetic opioidtrend
No Sub Projects information available for 3R01DA057599-01S1
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 3R01DA057599-01S1
Patents
No Patents information available for 3R01DA057599-01S1
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 3R01DA057599-01S1
Clinical Studies
No Clinical Studies information available for 3R01DA057599-01S1
News and More
Related News Releases
No news release information available for 3R01DA057599-01S1
History
No Historical information available for 3R01DA057599-01S1
Similar Projects
No Similar Projects information available for 3R01DA057599-01S1