cv
General Information
Full Name | Enrique Noriega-Atala |
Date of Birth | July 5th, 1987 |
Languages | English, Spanish |
Education
-
2020 PhD in Information
The University of Arizona, Tucson, Arizona - Minored in Statistics
-
2014 M.Sc. in Computer Science
The University of Arizona, Tucson, Arizona, US -
2010 B.S. in Information and Communication Technology
Tecnologico de Monterrey, Hermosillo, Sonora, Mexico - Minored in Software Engineering
Experience
-
2022-now Computational Sciences Researcher IV
Department of Computer Science, The University of Arizona, Tucson, Arizona - Co- developed and taught two workshops about AI and Machine Learning for The University of Arizona's DataLab
- Exploring the LLM Frontier
- AI Makerspace
- Co- lead the design and development of a university-wide private chatbot using open-source technologies. Responsible for RAG and LLM components, including dense passage retrieval indexing, prompt engineering, performance evaluation, LLM benchmarking, and API design.
- Co-Principal Investigator for Text Reading under DARPA’s ASKEM program. Led the development of a neuro-symbolic information extraction system for mathematical modeling from scientific literature. Coordinated a team of researchers to build a high-performance NLP system using Python and Scala. Project resulted in open-source software, one paper published at Findings of EMNLP 2024 and one manuscript under review.
- Developed an automated news aggregation dashboard for the AZHEALTHTXT project based on ReactJs-based application using PyTorch and HuggingFace transformers for filtering, classifying, summarizing, and translating news articles, aiding public health communication specialists.
- Designed and led the development of an LLM-powered chatbot for localized adverse weather and health-related resources under AZHEALTHTXT. Developed a functional RAG pipeline and user interface using Gradio, LangChain, and OpenAI API.
- Co- developed and taught two workshops about AI and Machine Learning for The University of Arizona's DataLab
-
2021-2022 Postdoctoral Research Associate
Department of Computer Science, The University of Arizona - Conducted biomedical NLP research under the Collaborative for Global Adaptive Pandemic Solutions initiative. Developed a visual analytics system for biomedical information extraction, published at the 16th IEEE Pacific Visualization Symposium.
- Co-authored five research publications and contributed to the REACH open-source project. Presented research at multiple conferences and collaborated on NSF and DARPA grant proposals and organized three research workshops and mentored graduate students in the Computer Science department.
-
2020 Sr. NLP Scientist
CondaMetrix LLC, Boston, MA - Developed NLP algorithms for electronic health records, including anatomy phrase classifiers and laterality named-entity recognizers. Trained and fine-tuned LSTM, GRU, and BERT models. Developed APIs to integrate models into the company's software ecosystem.
- Mentored interns and contributed to internal training curricula on NLP and machine learning.
-
2013-2019 Graduate Research and Teaching Associate
Department of Computer Science, The University of Arizona - Contributed to NLP and Machine Learning projects in the CLU lab and ML4AI, focusing on Information Extraction, Information Retrieval, and Reinforcement Learning, leading to multiple publications. Regular contributor to the REACH biomedical information extraction project.
- Served as Teaching Assistant for courses including Introduction to Machine Learning and Text Retrieval. Designed and graded assignments, implemented innovative grading methods, and provided support through substitute teaching, office hours, and exam proctoring.
- Courses:
- Computer Organization (2013)
- Data Structures (2014)
- Information Retrieval and Web Search (2015, 2016)
- Introduction to Machine Learning (2017, 2019)
-
2010-2012 Sr. Software Engineer
Fresh Software Concepts, Hermosillo, Sonora, Mexico - Worked on the development of an ERP system and multiple ancillary products for the produce industry based on the .NET stack.
- Lead the migration of multiple enterprise information systems into Microsoft’s Azure cloud from their previous on-premises location. Designed and coordinated the migration of software assets and data, as well as worked on software adaptations to make the systems work properly on their new cloud-based home.
- Participated on the design and architecture of multiple information systems based on the .NET stack.
-
2009-2010 Software Engineer
Teknol, S.A. de C.V., Hermosillo, Sonora, Mexico - Contributed to the development on GIS systems for multitouch hardware for the mining industry.
- Implemented multiple Web Services-based APIs for the company's products.
-
2008-2009 Intern
Internship at Centro de Investigación y Desarrollo de Ingeniería Avanzada, A.C., Hermosillo, Sonora, Mexico - Participated in the development of a web-based cattle management information system based on Python and Django.
-
2007-2008 Web Developer
Optima Commerce, Hermosillo, Sonora, Mexico - Contributed in the development of web-based international trade information system based on ASP.NET and C#.
-
2007 Jr. Software Engineer
Marketing Movil S.A de C.V., Hermosillo, Sonora, Mexico - Contributed in the development of a Java-based service for sending and transmitting text messages (SMS) for a local marketing agency.
Grants and Funding
-
2022-2024 SKEMA: Scientific Knowledge Extraction and Model Analysis, DARPA.
- co-Principal Investigator responsible for the design and development of a neuro-symbolic information extraction system for mathematical modeling from scientific literature.
-
2023-2024 Resilience Informatics for Public Health, Technology and Research Initiative Fund (TRIF), The University of Arizona.
- Key-personnel in charge of the design and development of a system for the automated aggregation, classification, summarization, and translation of news articles related to public health.
Patents
-
2024 Method and system for converting literature into a directed graph
- US Patent No. 12019981, co-inventor.
Open Source Projects
-
2014-now REACH
- Reach stands for Reading and Assembling Contextual and Holistic Mechanisms from Text. In plain English, Reach is an information extraction system for the biomedical domain, which aims to read scientific literature and extract cancer signaling pathways. Reach implements a fairly complete extraction pipeline, including: recognition of biochemical entities (proteins, chemicals, etc.), grounding them to known knowledge bases such as Uniprot, extraction of BioPAX-like interactions, e.g., phosphorylation, complex assembly, positive/negative regulations, and coreference resolution, for both entities and interactions.
Honors and Awards
-
2018 - The University of Arizona Graduate & Professional Student’s Council Travel Grant
- UofA School of Information’s Travel Award
- International Conference in Data Mining's Student Award
-
2014 - Galileo Circle Scholar, UofA’s College of Science
-
2013-2014 - Instituto Educativo Sonora-Arizona's scholarship
-
2012-2014 - CONACyT's graduate studies scholarship
-
2010 - CENEVAL’s outstanding performance testimony on the EGEL test