Position Title: Data Scientist
Clearance Type: Secret
Responsibilities (including but not limited to):
- Use Microsoft Visio to support continued development of the Pedigree of Analysis (PoA) Diagram which governs the collection and organization of data within the PoA tool (government-provided software application).
- Use Government designated visualization tools (e.g., Qlik Sense and/or Microsoft Power BI), to design and configure the PoA tool (software application) user interface to support intuitive navigation and data discovery, and allow understanding of presented arguments including argument completeness, credibility of supporting information, assumptions used, and status of missing information.
- Use Government-provided SharePoint libraries or designated relational database management system (RDBMS), to support the Capabilities Planning Database to ensure schema supports PoA data integration, analysis and visualizations. Work includes recommending changes to the database schema, participating in government review of recommended changes, and making database modifications.
- Provide data collection, research and analysis services to identify and record PoA data points from Capabilities Planning, Campaign of Learning (CoL), and Requirements Generation events and activities.
- Maintain briefing materials and conducting coordination and collaboration with Capability Portfolio Managers (CPMs) and Capability Integration Officers (CIOs) within CDD; CoL event and activity sponsors, facilitators, and participants; program managers and project officers; and any other stakeholders necessary to facilitate access to and understanding of unstructured and structured data related to the events and activities.
- Locate, obtain, and store source information/documents; extracting and structuring data from source information; and associating information to capability requirements, capability gaps, capability solutions, POM initiatives, and other items.
- Use Microsoft Excel, Government-provided SharePoint lists, designated RDBMS, and/or Databricks, to support integration of required structured data sources into the PoA tool (software application) within the Jupiter platform or other environment specified by the government.
- Conduct analysis of critical semi-structured data sources (e.g., the MCPRIME database) and using manual and automated methods, normalize or otherwise improve the utility of the data for use in the PoA tool.