The Research Data Workflow Developer will support the implementation of the TigerData service, a comprehensive set of data storage and management tools and services that provide storage capacity, reliability, functionality, and performance to campus. In this role, the Research Data Workflow Developer will work with researchers across Princeton to understand and document active data processes and implement automated workflows for the acquisition/creation, storage, and movement of data to enable research. They will be leveraging understanding of research data trends at Princeton to provide direction and guidance to the evolution of the TigerData service and will be responsible for scripting and development required to automate workflows using MediaFlux as TigerData grows. This role is also responsible for troubleshooting system-level issues related to monitoring and performance, as well as partnering with technical staff to upgrade and update MediaFlux as needed.
Design/Analysis
-Work with researchers across Princeton to understand and document active workflow processes that
support the efficient acquisition or creation, storage, and movement of data during the research life
cycle
-Analyze and document research data movement requirements between TigerData storage tiers;
understand trends in research, identify best practices, and document automation strategies to
support data movement for active research
-Advise faculty and researchers on data intensive research and big data considerations
-Develop and maintain documentation related to research data storage and movement workflows
API/Script Development
-Understand the MediaFlux environment and API
-Using that API, design, develop, and maintain configuration to support the automation of research
data movement between storage tiers, within TigerData and across multiple storage platforms
-Implement, maintain, and refine developed workflows configured to support data and storage
management for researchers
-Use scripting and programming tools (bash, Python, and Tcl) and perform basic debugging
-Contribute scripting to manage the MediaFlux environment including; maintenance, monitoring,
software updates, and data movement
Performance Tuning and Monitoring
-Work with colleagues to test, assess and make recommendations for adjustments to hardware,
software, and network settings to increase performance of data movement
-Increase efficiency of data movements by developing automation and tuning processes
-Identify and implement tools to assist with the monitoring performance of data movement
Professional Development
-Keep abreast of emerging technologies related to data storage and movement
Essential Qualifications:
Preferred Qualifications:
Princeton University is an Equal Opportunity/Affirmative Action Employer and all qualified applicants will receive consideration for employment without regard to age, race, color, religion, sex, sexual orientation, gender identity or expression, national origin, disability status, protected veteran status, or any other characteristic protected by law. KNOW YOUR RIGHTS