Ryan Tang, Developer in Durham, NC, United States
Ryan is available for hire
Hire Ryan

Ryan Tang

Verified Expert  in Engineering

Statistics Developer

Location
Durham, NC, United States
Toptal Member Since
January 21, 2022

Ryan是一名应用科学家,帮助企业在解决复杂问题时释放数据的全部潜力, complex business problems. For the past 8 years, he's been dedicated to building pragmatic, 数据驱动的解决方案,将科学的严谨性与实际的商业洞察力相结合. With experience spanning technology, real estate, and insurance industries, he's played a pivotal role in driving significant revenue growth, developing cutting-edge products, and optimizing business functions.

Portfolio

Various Hedge Funds
Python, QuantConnect,统计学,贝叶斯统计,统计建模...
Reddit, Inc.
Data Science, Distributed Systems, Software Engineering, Go, Scala, Python...
Duke University | Department of Statistics
Python, Algorithms, Machine Learning, Statistics, Bayesian Statistics...

Experience

Availability

Full-time

Preferred Environment

Visual Studio Code (VS Code), Jupyter Notebook, Python, Git, Data Wrangling

The most amazing...

...我开发的项目是一个统一的自动竞价算法和底层框架,它影响了Reddit 75%以上的广告收入.

Work Experience

Quantitative Strategy Research Consultant

2021 - PRESENT
Various Hedge Funds
  • Researched, designed, 对各类小型对冲基金实施中频统计套利量化策略.
  • Provided and promoted best practices on infrastructure, technology stacks, automated CI/CD, MLOps, and data literacy.
  • 指导客户使用新技术栈,并确保基础设施的持续维护.
  • Contributed to strategies traded on equities, options, futures, and forex, which have been consistently delivering a Sharpe ratio of 2+ since then.
Technologies: Python, QuantConnect,统计学,贝叶斯统计,统计建模, Backtesting Trading Strategies, Financial Modeling, Quantitative Finance, Trading, Data Wrangling

Senior Machine Learning Engineer

2022 - 2023
Reddit, Inc.
  • Led and contributed to Reddit's auto-bidding strategies. 致力于在分布式实时环境中设计和实现核心算法.
  • Contributed to the incremental improvements in revenue of 2.5%, budget utilization of 12%, and 30% clicks.
  • 在整个自动投标策略背后的算法和基础设施方面提供技术领导.
  • Took ownership of Maximize Clicks v2, Maximize Clicks v2.5, and Max Clicks Lagrangian.
  • Performed rigorous experiment design and statistical validation throughout.
  • Spearheaded distributed processing of over terabytes each day.
Technologies: Data Science, Distributed Systems, Software Engineering, Go, Scala, Python, Java, Spark, BigQuery, ETL, Mathematics, Quantitative Analysis, Numerical Analysis, Algorithms, Back-end Development, Machine Learning, Optimization, Statistics, Statistical Modeling, Bayesian Statistics, Bayesian Inference & Modeling, Real-time Streaming, Real-time Systems, Real-time Bidding (RTB), Experimental Design, Causal Inference, Reinforcement Learning, Docker, Amazon Web Services (AWS), GitHub, Advertising, Event-driven Programming, Time Series Analysis, Data Engineering, NumPy, Pandas, Data Analytics, Statistical Learning, Linear Programming, SQL, Data Visualization, Distributed Computing, Data Pipelines, Computational Advertising, Linear Algebra, Object-oriented Programming (OOP), Visual Studio Code (VS Code), Jupyter Notebook, Git, Scikit-learn, Data Modeling, Machine Learning Operations (MLOps), Backtesting Trading Strategies, Financial Modeling, Data Wrangling

Research Scientist

2021 - 2022
Duke University | Department of Statistics
  • 利用统计和机器学习知识开发新的方法,同时改进现有的最先进的方法.
  • Conducted research aligned with recent field developments and literature. 运用定性和定量分析和数据收集工具,在规定的时间内完成分配的任务.
  • 协助团队对MovieLens 25M数据集进行深度数据分析,从多个角度探索人们的电影评分行为.
  • 最终确定并向小组提交研究结果,并就具体主题提出建议. Accomplished a seven-page write-up, supporting the team a step closer to the goal of publishing a paper.
Technologies: Python, Algorithms, Machine Learning, Statistics, Bayesian Statistics, Recommendation Systems, Computational Advertising, Research, Mathematics, PostgreSQL, Data Science, NumPy, Pandas, SQL, Data Engineering, Quantitative Analysis, Distributed Systems, ETL, Numerical Analysis, Ads, Advertising, GitHub, Git, Data Analytics, Statistical Learning, Statistical Modeling, Experimental Design, Causal Inference, Reinforcement Learning, Software Engineering, Time Series Analysis, Linear Programming, Data Visualization, Data Pipelines, Linear Algebra, Object-oriented Programming (OOP), Visual Studio Code (VS Code), Jupyter Notebook, Bayesian Inference & Modeling, Scikit-learn, Data Wrangling

Principal

2015 - 2021
Ridge Equities
  • Spearheaded private equity fund operations, 通过系统化的市场运作和策略制定,优化单户增值租赁投资的运营效率.
  • Standardized business operations, value-add capital improvement projects, budget and timeline controls, trade coordination, and quality control assurance compliance with policies or regulations.
  • 通过指导总资产超过500万美元,扩大了商业机会, 利用管理和出色的沟通技巧,传达持续的年股本回报率超过15%.
  • Bolstered operations, revenue generation, 通过为费城地铁的33个单位制定创新的投资组合管理策略来扩大客户群.
  • Executed comprehensive property management, incorporating best practices in tenant screening, repair and maintenance, cost control, rent collection, dispute handling, and capital improvement to meet optimal equity and internal rate returns.
  • 促进利益相关者和跨职能团队之间的战略领导和沟通, 灌输公司愿景,以影响业务转型和实现目标.
Technologies: Python, Dashboards, Statistics, Machine Learning, Business Intelligence (BI), Asset Management, Equity Investment, Asset Valuation, Leadership, Property Management, Private Equity, Wealth Management, PostgreSQL, Dash, Quantitative Analysis, Algorithms, WebApp, Flask, Back-end Development, Data Science, Git, GitHub, Data Analytics, Statistical Learning, Statistical Modeling, Back-end, Pandas, NumPy, SQL, Data Engineering, Experimental Design, Causal Inference, Algorithmic Trading, Event-driven Programming, Numerical Analysis, Software Engineering, ETL, Time Series Analysis, Data Visualization, Data Pipelines, Linear Algebra, Object-oriented Programming (OOP), Mathematics, Visual Studio Code (VS Code), Jupyter Notebook, Scikit-learn, Financial Modeling, Data Wrangling

Senior Data Scientist

2016 - 2017
Guardian Insurance
  • 开发公司首个针对寿险购买者关键生活事件和行为驱动因素的客户细分模型, 利用广泛的统计建模和从各种来源的大量数据集中提取数据.
  • Achieved an average of 1.6 times target segment lifts, 降低获客成本,提高会话率,优化整体营销盈亏(P&L).
  • 通过在前景预测模型中引入具有额外关键行为特征的非线性,将AUC指标放大了8%以上.
Technologies: Python, Analytics, Business Intelligence (BI), Hadoop, Spark, Machine Learning, Customer Segmentation, Cross-selling, Upselling, Statistics, PostgreSQL, Oracle, PySpark, MapReduce, Data Pipelines, Distributed Computing, NumPy, Pandas, Data Engineering, SQL, Data Science, Distributed Systems, Software Engineering, BigQuery, ETL, Tableau, Quantitative Analysis, Numerical Analysis, Algorithms, Git, GitHub, Back-end, Amazon Web Services (AWS), Docker, Data Analytics, Statistical Learning, Statistical Modeling, MySQL, MongoDB, Causal Inference, Experimental Design, Event-driven Programming, Linear Programming, Data Visualization, Bayesian Statistics, Linear Algebra, Object-oriented Programming (OOP), Mathematics, Visual Studio Code (VS Code), Jupyter Notebook, Scikit-learn, Data Modeling, Machine Learning Operations (MLOps), Financial Modeling, Quantitative Finance, Data Wrangling

Business Analyst

2014 - 2016
Guardian Insurance
  • 通过数据解释和分析建立了丰富的交互式可视化,以集成多个数据源以支持性能分析, agency and producer ranking and awards, and internal marketing strategy.
  • Evaluated data collection processes for various business reports, utilizing multiple datasets to develop visual displays of solutions. 以书面和口头形式交流数据分析结果,以使演示更有效.
  • 通过更新最新的信息技术应用程序,制定策略性的商业智能解决方案. 使用Python、Tableau、Excel和VBA自动化80%以上的部门内部临时报告.
Technologies: Python, Statistics, Analytics, Business Intelligence (BI), Dashboards, Excel 365, Excel VBA, Tableau, PostgreSQL, Oracle, Data Visualization, Data Pipelines, Data Cleaning, Data Scraping, SQL, Data Engineering, NumPy, Pandas, Data Science, Quantitative Analysis, ETL, Algorithms, Numerical Analysis, Git, GitHub, Back-end, Data Analytics, Statistical Learning, Statistical Modeling, Software Engineering, Linear Algebra, Object-oriented Programming (OOP), Mathematics, Visual Studio Code (VS Code), Jupyter Notebook, Machine Learning, Scikit-learn, Financial Modeling, Quantitative Finance, Data Wrangling

Operation Research Consultant

2015 - 2015
Gemological Institute of America
  • 在一个供应链优化项目中管理超过三名专业人员,以简化内部质量控制物流系统.
  • 运用线性规划对物流系统进行了理论化,提出了生产实施的路线. 提供了一个关于Python和Django框架的全尺寸演示,重点是在线学习.
  • Formulated an operational strategy, mapped a value chain, and conducted quantitative research for prospective institute models.
Technologies: Python, Django, Operations Research, Linear Programming, Optimization, Research, Data Science, Data Engineering, SQL, MySQL, NumPy, Pandas, Machine Learning, Quantitative Analysis, Numerical Analysis, Algorithms, Back-end, Back-end Development, Git, GitHub, Data Analytics, Statistical Learning, Statistical Modeling, Software Engineering, ETL, Data Visualization, Linear Algebra, Object-oriented Programming (OOP), Statistics, Mathematics, Visual Studio Code (VS Code), Jupyter Notebook, Scikit-learn, Data Wrangling

Equity Investment Web App

这是一个由streamlite驱动的数据应用程序,用于股票价值投资研究. 该应用程序的最终目的是提供全面的基础数据,以做出明智的投资决策. It consists of the competitor analysis, debt and leverage analysis, operational efficiency, return on investment (ROI), return on equity (ROE), and cash flow.

Distributed Event-driven Backtesting System

我使用了一个python事件驱动的回溯测试系统来分析我的定量策略. It has a component that handles slippage and order executions, a portfolio manager that rebalances between multiple concurrent strategies, and an extensive backtesting analytics component for in-depth research.

Manhattan College Business Analytics Competition | First Place

http://manhattan.edu/news/archive/2015/05/first-annual-business-analytics-conference-and-competition-explores-art-and-science-decision
这些活动邀请了行业领袖,并为学习商业分析或相关领域的本科生提供了一个令人兴奋的机会,以测试他们的知识和发展他们的技能. 竞争的学生们参与决策的“艺术与科学”,同时练习他们通过创造性的方式对数据进行综合分析,得出商业见解的能力. My team and I, as a team lead, won first place in this competition.

Languages

Python, SQL, Scala, Excel VBA, Go, Java

Libraries/APIs

Pandas, NumPy, Scikit-learn, PySpark

Tools

Git, Tableau, BigQuery, GitHub

Paradigms

Object-oriented Programming (OOP), Unit Testing, Business Intelligence (BI), Distributed Computing, Linear Programming, Data Science, ETL, Event-driven Programming, Real-time Systems, Dynamic Programming, MapReduce

Platforms

Jupyter Notebook, Oracle, Docker, Visual Studio Code (VS Code), Amazon Web Services (AWS)

Storage

PostgreSQL, Data Pipelines, MySQL, MongoDB

Other

Operations Research, Mathematics, Statistics, Big Data, Analytics, Algorithms, Linear Algebra, Partial Differential Equations, Principal Component Analysis (PCA), Optimization, Stochastic Gradient Descent (SGD), Machine Learning, Bayesian Statistics, Recommendation Systems, Computational Advertising, Research, Dashboards, Asset Management, Equity Investment, Asset Valuation, Private Equity, Wealth Management, Customer Segmentation, Excel 365, Data Visualization, Data Cleaning, Statistical Learning, Data Analytics, Data Engineering, Financial Engineering, Competitor Analysis & Profiling, Time Series Analysis, Distributed Systems, Software Engineering, Quantitative Analysis, Numerical Analysis, Algorithmic Trading, Statistical Modeling, Reinforcement Learning, Bayesian Inference & Modeling, Experimental Design, Real-time Streaming, Real-time Bidding (RTB), Data Modeling, Machine Learning Operations (MLOps), QuantConnect, Backtesting Trading Strategies, Financial Modeling, Quantitative Finance, Trading, Data Wrangling, Graph Theory, Leadership, Property Management, Cross-selling, Upselling, Dash, Data Scraping, APIs, Ads, Advertising, Back-end, Causal Inference, Natural Language Processing (NLP), Signal Processing, Back-end Development, Game Development, Artificial Intelligence (AI), GPT, Generative Pre-trained Transformers (GPT)

Frameworks

Hadoop, Spark, Django, Streamlit, WebApp, Flask

2022 - 2023

Master's Degree in Statistical Science

Duke University - Durham, NC, United States

2011 - 2015

Bachelor's Degree in Business Analytics

Pace University - New York, NY, United States

JANUARY 2022 - PRESENT

Reinforcement Learning Specialization

Coursera

NOVEMBER 2021 - PRESENT

Fundamentals of Computing Specialization

Coursera

OCTOBER 2021 - PRESENT

Mathematics for Machine Learning Specialization

Coursera

Collaboration That Works

How to Work with Toptal

在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.

1

Share your needs

在与Toptal领域专家的电话中讨论您的需求并细化您的范围.
2

Choose your talent

在24小时内获得专业匹配人才的简短列表,以进行审查,面试和选择.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring