Citi Distributed Data Technologies Architect VP (Tampa FL or Irving TX) in Irving, Texas

  • Primary Location: United States,Texas,Irving

  • Other Location: United States,Florida,Tampa

  • Education: Bachelor's Degree

  • Job Function: Technology

  • Schedule: Full-time

  • Shift: Day Job

  • Employee Status: Regular

  • Travel Time: Yes, 10 % of the Time

  • Job ID: 16048300


Distributed Data Processing Platform Architect leads Distributed Data Servers and Parallel and Distributed Computing Projects:

  • Evaluates, Estimates, and Creates High Performance Distributed Computing and Distributed Data Management Architectures.

  • Leads Platform Development Projects: Design and Development of Data Sources Abstraction Connectors, Data Extraction and Ingestion Algorithms, Heterogeneous Data Processing and Integration Solutions.

  • Estimates Project Resources and Evaluates Cost/Time/Scope tradeoffs.

  • Creates Agile Project Plans

  • Leads Projects through SDLC

  • Delivers Projects and provides appropriate support.

  • Creates and delivers presentations for technical and business audiences.


Required Knowledge and Skills:

  • 7+ years’ experience in the following Areas:

  • Software Engineering and Programming Data Structures: Arrays, Lists, Queues, Heaps, Stacks, Trees, Graphs.

  • Algorithms: Sorting, Searching, Hashing, Parsing, Greedy

  • Algorithms, Graph Algorithms.

  • 10 years of expert-Level Java server-side software design and development.

  • Solid core Java and multithreading experience.

  • Strong knowledge of socket programming and proficient in TCP/IP and UDP-based network programming (sockets, datagrams and multicasting) in Java.

  • Solid understanding of the major design patterns.

  • Experience working with any Parser Generators like ANTLR

  • Experience writing any code generators based on your custom grammar.

  • Big Data experience is a big plus(Hive, Pig, Spark, etc.).

  • Any experience extending or contributing to open source frameworks is a big plus.

  • Data Processing

  • Data Models: Conceptual, Logical, Physical Relational Data Models. Components, Relation Types, Normal Forms and Normalization Procedures. Relational Data Models Design. Data Modeling Patterns: Super type-subtype, Star Schemas, Snowflakes, Dimensional Modeling. Denormalization procedures.

  • SQL Parsing, Optimization and Execution

  • Access paths, Filtering and Lookup Algorithms, Join Algorithms,

  • Data processing performance vehicles: Indexes and Indexing St

  • Materialized Views, Clusters, Caching, Parallel Processing

  • Partitioning Strategies.

  • Transactions: Definition, Properties, Isolation Levels. Locking Strategies

  • Distributed Transactions

  • Enterprise Level Database servers: Teradata, Oracle, DB2, Sybase Security and Data Access Management in Enterprise DB Servers.

  • Data Processing Architectures, Memory Architectures, Storage Architectures: shared memory, shared storage, shared nothing SMP and MMP

  • Extensible hashing, B+Trees, R+Trees.

  • Buffer Pool Management Strategies.

  • Data recovery strategies. Redo logs and Undo logs.

  • Parallel processing patterns: Pipelined and Partitoned DW Appliances: Netezza, Vertica, Greenplum, Exadata Column Oriented Databases

  • NoSQL architectures

  • Map Reduce Architectures.

  • Declarative Data Processing layers. Declarative Data Processing Languages and their translation to MapReduce or SQL.

  • Data Replication Architectures

  • Data Access Management Architectures

  • Data Encryption, Decryption, Masking Architectures. Data Quality Concepts Master Data Repositories Metadata Repositories.

Education BS or higher in Computer Science or Engineering or equivalent work experience

This position is local to Irving TX. Relocation is not budgeted for this position. Candidates from outside of the area must be able to relocate themselves. Relocation assistance will not be provided.