AI applied to critical workflows
I have experience with audit automation, contract reading, clause extraction, regulatory compliance and technical information retrieval using LLMs and multi-agent systems.
Hello, welcome. I am Flávia, a Senior Data Scientist with 6 years of experience in AI, data and automation. My work combines generative AI, NLP, machine learning and the full data process behind data science, from information structure and quality to modeling, automation and real-world application.
Executive Summary
I have experience with audit automation, contract reading, clause extraction, regulatory compliance and technical information retrieval using LLMs and multi-agent systems.
I work with Python, Spark, Databricks, Delta Lake, MLflow, Streamlit, LangChain and pipelines designed for governance, monitoring and production.
Alongside professional delivery, I keep strong technical production, recent Data Engineering certifications and academic research in Data Science.
Frameworks and Technologies
Experience
Role: Senior Data Scientist
Work with PySpark, Databricks and SQL in analytical flows, data quality, inconsistency analysis, automated refresh and applied AI for operational support.
Role: Senior Data Scientist / Senior AI Engineer
Development of solutions for PDF reading, rule extraction, financial-flow organization and automation with agents, workflows and analytical persistence.
Role: AI Specialist Data Scientist
Creation of LLM-based solutions for structured document extraction, retrieval, validation interfaces and pipeline evolution with observability.
Role: Data Scientist / AI Specialist
Work across machine learning, NLP, information extraction, text classification, validation interfaces and the evolution from heuristic pipelines to generative AI.
Role: Data Analyst
Analysis of large datasets, indicator construction, territorial monitoring, dashboards and analytical support for operational and policy-oriented reading.
Role: Data Science Intern
Automated content collection, NLP with spaCy, data visualization and experimental modeling and deep learning projects.
Areas of Practice
Development of solutions with LLMs, RAG, agents and structured workflows for extraction, analysis, automation and decision support in corporate contexts.
Experience with classification, supervised models, natural language processing, embeddings, deep learning and applications focused on text, documents and unstructured data.
Design of analytics environments, scalable pipelines, governance, data quality and modern architecture supporting analytics, machine learning and AI.
Exploratory analysis, dataset reconciliation, dashboards, monitoring and translation of data into clear insights for technical, operational and business teams.
Education and Certifications
Master’s in Applied Computing at UnB, MBA in Artificial Intelligence and Big Data at USP, and a Bachelor’s degree in Data Science and Artificial Intelligence from IESB.
Data Science research line focused on named entity recognition in legal texts, supported by a strong foundation in databases, large-scale data mining and applied experimentation.
Recent certifications in Airflow, Spark, Snowflake, BigQuery, Modern Data Stack and Databricks tracks connected to machine learning data preparation and retrieval agents.
Highlighted Certifications
Badge connected to machine learning tracks and modern data ecosystems.
Building retrieval agents and applied workflows inside the Databricks ecosystem.
Training focused on building data solutions and operations on enterprise platforms.
Additional foundation in AI application and workflows within the Palantir ecosystem.
Specialization in OpenAI Function Calling and LangChain tools for turning LLMs into operational agents.
End-to-end analytics workflow using BigQuery, R, SQL and Sheets.
Training in graph-based orchestration, observability, traceability and evaluation of LLM applications, strengthening AI engineering practices.
Academic Experience
In my undergraduate capstone, I developed two spaCy models fine-tuned for named entity recognition in Brazilian legal texts using the LeNER-Br dataset and published a working app on Hugging Face.
In my MBA thesis, I developed and validated a convolutional neural network model to classify chest X-ray images with a focus on supporting COVID-19 diagnosis.
Where to Find Me
Contact
I am always open to new opportunities.