Walmart logo

Staff Data Scientist

Walmart
2 days ago
Full-time
On-site
Sunnyvale, California, United States

What you'll do...

Position: Staff Data Scientist

Job Location: 1375 Crossman Ave, Sunnyvale, CA 94089

Duties: Build complex data sets from multiple data sources, both internally and externally. Conduct advanced statistical analysis to determine trends and significant data relationships. Build learning systems to analyze and filter continuous data flows and offline data analysis. Train algorithms to apply models to new data sets. Validate models and algorithmic techniques. Scale new algorithms to large data sets. Combine data features to determine search models. Research new techniques and best practices within the industry. Utilize system tools including (MySQL, Hadoop, Weka, R, Matlab,ILog). Develop multiple custom data models to drive innovative business solutions. Translate business needs into data requirements. Collaborate with cross-functional partners across the business. Interpret data to identify trends to go across future data sets. Collaborate with project teams to implement data modeling solutions. Develop models of current state in order to determine improvements needed. Responsible for analyzing large data sets to develop multiple custom models and algorithms to drive innovative business solutions. Work on large project teams in order to provide analytical support and guidance to an assigned are on for large projects (for example, email targeting, business optimization, consumer recommendations) within Walmart eCommerce. Responsible for building large data sets from multiple sources in order to build algorithms for predicting future data characteristics. Those algorithms will be tested, validated, and applied to large data sets. Responsible for training the algorithms so they can be applied to future data sets and provide the appropriate search results. Responsible for researching new trends in the industry and utilizing up-to-date technology (for example, HBase, MapReduce, LAPack, Gurobi) and analytical skills to support their assigned project. Act as the subject matter experts for statistical analysis and modeling for their project team.

Minimum education and experience required: Master’s degree or equivalent in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology or related field and 2 years of experience in analytics related field; OR Bachelor’s degree or equivalent in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology or related field and 4 years of experience in analytics related field.

Skills required: Experience conducting independent and unbiased validation of machine learning models, including systematic and rigorous assessment of data quality, technical soundness, and model performance towards business objectives, data drifting, model stability, and model implementation accuracy. Experience with advanced machine learning algorithms, encompassing both supervised and unsupervised learning techniques, as well as recommender systems. Experience with natural language processing (NLP) techniques, including embedding models (Word2Vec, GloVe, and FastText) and transformer-based architectures (BERT and GPT). Experience with time series modeling techniques, including classical techniques (ARIMA) and neural network-based approaches (RNN and LSTM). Experience with statistical inference, experimental design, and hypothesis testing. Experience collaborating on and maintaining scalable data pipelines using distributed processing frameworks (Spark) and orchestration tools (Airflow) to support machine learning workflows and real-time feature updates. Experience with end-to-end development, deployment, and maintenance of AI/ML models in large-scale production environments within cloud-based infrastructures. Experience in ensuring the ethical use of AI models, including applying advanced techniques (including sampling and weighting) for mitigating bias and utilizing tools (including PDP, SHAP, and LIME) for promoting transparency and explainability in models. Experience developing or contributing to organization-level modeling standards to ensure alignment with industry best practices, reproducibility, risk mitigation, and consistency. Experience programming languages including Python (pandas, NumPy, scikit-learn, SciPy, stats models, TensorFlow, PyTorch, XGBoost) and SQL. Experience conducting and publishing peer-reviewed research involving innovation and application of established methods to extract insights in the field of data science. Employer will accept any amount of experience with the required skills.

Salary Range: $171,337/year to $286,000/year.  Additional compensation includes annual or quarterly performance incentives.

Benefits: At Walmart, we offer competitive pay as well as performance-based incentive awards and other great benefits for a happier mind, body, and wallet. Health benefits include medical, vision and dental coverage. Financial benefits include 401(k), stock purchase and company-paid life insurance. Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty and voting. Other benefits include short-term and long-term disability, education assistance with 100% company paid college degrees, company discounts, military service pay, adoption expense reimbursement, and more.

Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to a specific plan or program terms. For information about benefits and eligibility, see One.Walmart.com.

Wal-Mart is an Equal Opportunity Employer.

#LI-DNI #LI-DNP

Walmart and its subsidiaries are committed to maintaining a drug-free workplace and has a no tolerance policy regarding the use of illegal drugs and alcohol on the job. This policy applies to all employees and aims to create a safe and productive work environment.