The School of Media and Journalism at the University of North Carolina at Chapel Hill seeks a data scientist to work with students, faculty and industry in the Reese News Lab. This is a full-time, 12-month non-faculty position that is funded by a three-year grant from the Knight Foundation.
This is an exciting opportunity to apply data analysis skills to media innovation and the challenge of getting the right information to the right people at the right time — especially information that citizens need to hold powerful people accountable and better understand the complex world we all share.
He/she needs to be able to communicate their findings, orally and visually, and know how to integrate multiple systems and data sets and be able to link and mash up distinctive data sets to discover new insights. This often requires connecting different types of data sets in different forms as well as being able to work with potentially incomplete data sources and cleaning data sets to be able to use them.
He/she needs to be able to program, preferably in different programming languages such as Python, R, or SQL.
In addition, they need to be familiar with disciplines such as:
Natural Language Processing: the interactions between computers and humans;
Machine learning: using computers to improve as well as develop algorithms;
Conceptual modelling: to be able to share and articulate modelling;
Statistical analysis: to understand and work around possible limitations in models;
Predictive modelling: most of the big data problems are towards being able to predict future outcomes; and
Hypothesis testing: being able to develop hypothesis and test them with careful experiments.
Education and Experience:
-Data engineering experience, including SQL and manipulating large structured or unstructured datasets for analysis. -Communicating complex data analysis to lay audiences. -Along with strong written and verbal communication skills, big data scientists should have the ability to: -Work in a fast-paced multidisciplinary environment. -Collect public data by scraping websites and using APIs. -Negotiate with local, state and federal officials for access to public datasets. -Clean dirty data, especially names and addresses, and ensuring that the limitations of the data are well understand by all stakeholders. -Perform exploratory data analysis with tools such as Python, R, and QGIS. -Create predictive models. -Perform natural language processing tasks, especially named entity recognition. -Store and query data with SQL and big data frameworks. -Provide data to students and faculty who will visualize it using tools such as D3. -Build a data sharing pipeline to make public data useful for various audiences -Work autonomously. -Preferred, but not required: teaching, training or mentoring experience.
The University of North Carolina at Chapel Hill is an equal opportunity and affirmative action employer. All qualified applicants will receive consideration for employment without regard to age, color, disability, gender, gender expression, gender identity, genetic information, national origin, race, religion, sex, sexual orientation, or status as a protected veteran.
About The University of North Carolina at Chapel Hill
Carolina rates among the nation’s great institutions of higher education, set on an historic and beautiful campus that celebrates all four seasons. Carolina’s students, faculty and staff come here from around the world, bringing varied cultural, racial and ethnic heritages that help make UNC-Chapel Hill a thriving intellectual center. We’re repeatedly ranked the nation’s best value in higher educa...tion for students seeking to earn a college degree – the University has garnered the top spot each time since the ranking’s launch in 1998.Whether you’ve just started your career, are new to academia or are a seasoned professional with assorted campus experiences to draw on, we hope you will find a great place at Carolina.