All Lines Technology is seeking a Big Data Architect. In this role the Big Data Architect will lead the architecture and design Data Platform in alignment with providing a unified data solution: aggregating customer and healthcare operational data, and providing easy access to powerful insights. As such, this position will be responsible for the technical and security architecture of the software applications, as well as the supporting infrastructure. This position will help drive quality, reliable, secure applications, using industry standard best practices.
PRIMARY DUTIES AND RESPONSIBILITIES
Lead a development team of big data designers, developers, data scientists, and DevOps
Implement a big data enterprise warehouse, BI and analytics system using Hive, Spark, RedShift, EMR (Hadoop), and S3
Develop and maintain processes to acquire, analyze, store, cleanse, and transform large datasets using tools like Spark, Kafka, Sqoop, Hive, NiFi, and MiNiFi
Provide recommendations, technical direction and leadership for the selection and incorporation of new technologies into the Hadoop ecosystem
Participate in regular status meetings to track progress, resolve issues, mitigate risks and escalate concerns in a timely manner
Contribute to the development, review, and maintenance of product requirements documents, technical design documents, and functional specifications
Help design innovative, customer-centric solutions based on deep knowledge of large-scale, data-driven technology and the healthcare industry
Help develop and maintain enterprise data standards, best practices, security policies and governance processes for the Hadoop ecosystem
Bachelor’s in computer information technology, computer science, management systems or related discipline required; Master’s preferred
Four-year degree in Computer Science/Software Engineering or related degree program, or equivalent application development, implementation and operations experience.
Advanced study or degrees such as Master’s degree in Business (MBA), Masters, PhD., in Computer Science/Software Engineering or a related scientific degree program is preferred
Minimum 5+ years of experience in large systems analysis and development, addressing unique issues of architecture and data management. Has the experience to work at the highest technical level of all phases of systems analysis and development activity, across the full scope of system development cycle
4+ years related experience on data warehousing and business intelligence projects
3+ years implementation or development experience with the Hadoop ecosystem
Working knowledge of the entire big data development stack
Experience handling very large data sets (10’s of terabytes and up preferred)
Experience with secure RESTful Web Services
Highly proficient with Java/Scala application development
Expert in Apache Spark infrastructure and development
Experience with Sqoop, Spark, Hive, Kafka, and Hadoop
Experience with automated testing for Big Data platforms
Experience with best practices for data integration, transformation, governance and data quality
Experience with developing, designing and coding, completing programming and documentation, and performing testing for complex ETL applications (Spark & Scala preferred)
Experience with Agile software development process and development best practices
Experience with Big Data text mining and big Data Analytics preferred
Understanding of Big Data Architecture along with tools being used on Hadoop ecosystem
Ability to lead tool suite selection and lead proofs of concepts
Ability to share ideas among a collaborative team and drive the team based on technical expertise and sharing best practices from past experience
Strong understanding and experience executing several software development methodologies and life cycles. Ability to understand and translate business requirements into technical specifications.
Ability to negotiate with and influence senior management. Ability to lead and influence across departments and across levels of leaderships both internally and with customers.
Proven ability to organize/manage multiple priorities coupled with the flexibility to quickly adapt to ever-changing business needs.
Excellent written and oral communication skills. Adept and presenting complex topics, influencing and executing with timely / actionable follow-through.
Strong analytical and problem-solving skills with the ability to convert information into practical training deliverables. Uses rigorous logic and methods to solve difficult problems.
Internal Number: KKTT126
About All Lines Technology
All Lines Technology is a leading provider of enterprise technology solutions and services, with proven expertise enabling organizations to overcome obstacles and achieve excellent business results.
All Lines Technology is a woman owned solutions provider that delivers cost effective, industry standard IT solutions to our customers. We strive to be a Professional Business Partner and Trusted Advisor with each of our clients. In doing so, we help companies streamline and improve the way they buy, implement, and manage their technology infrastructures that support their mission critical business applications. These scalable solutions deliver benefits to companies from start up to Fortune 500.
All Lines Technology specializes in industry standard solutions for Enterprise Infrastructure, Microsoft Collaboration and Productivity Solutions, MSP/Cloud Solutions, IT Staffing, IT Consulting, and 24/7/365 Help Desk. As a value-added solutions provider, we partner with only best-of breed industry leaders that meet the business needs of our clients. We use our expertise to deliver streamlined solutions customized to the unique needs of your business, and strive to be your true trusted ad...visor. By leveraging cutting edge technology deployment and management solutions from industry leaders, we ensure you receive the technology solutions you require, delivered with the highest level of service.
Our corporate headquarters is located in Warrendale, PA which includes a state of the art datacenter showcasing some of the latest technology hardware and software solutions for our customers as well as a Tier 3 datacenter for our cloud based services. We utilize these for live demos, proof of concepts, and application testing. In addition, we also have a 20,000 square foot warehouse, staging & integration center that we use for client management/imaging and warehousing. We have a secondary location in Cranberry Township, PA where our 24/7/365 Help Desk and network operations center resides.