Highest Voted 'aws-data-pipeline' Questions

0

votes

1 answer

Getting current AWS Data Pipeline status from Java

I am trying to access the current status of a data pipeline from Java Data Pipeline client. My use case is to activate a pipeline and wait till it's in completed state. I tried the answer from this thread: AWS Data Pipeline - Components, Instances…

java amazon-web-services aws-data-pipeline

asked Dec 18 '19 at 10:43

Dhruv Kumar

1

0

votes

1 answer

Duplication of data using data pipeline

I am trying to make a backup of the dynamoDb data into S3 using the AWS data pipeline and scheduled it to every 15 minutes in the data pipeline setting. Template i have used is the default provided i.e. "Export DynamoDB table to S3". Problem is…

amazon-web-services amazon-s3 amazon-dynamodb aws-data-pipeline

asked Dec 13 '19 at 09:30

Love Babbar

141
3
12

0

votes

1 answer

Increase & Decrease DynamoDb RCU from AWS DataPipeline

I have an AWS DynamoDb table which is write intensive. I've configured it in the provisioned capacity mode with 10,000 WCU and 1000 RCU. I'm using AWS Datapipeline to export DynamoDb contents to S3. The pipeline is configured with the read…

amazon-web-services amazon-dynamodb amazon-data-pipeline aws-data-pipeline

asked Nov 20 '19 at 13:57

venkatvb

673
1
9
23

0

votes

0 answers

Trigger AWS Lambda function whenever a new file arrived on two different s3 prefixes

Every day we get one incremental file, and we have multiple sources from which we gets incremental files. And both will place these files in two different s3 prefixes. But they come in different time. We want to process both the files in one go and…

aws-lambda aws-data-pipeline

asked Nov 04 '19 at 11:42

Krish

105
1
9

0

votes

0 answers

AWS ETL solutions for small data

My objective is to get the data from S3 files, transform and save it to a datasource(could be dynamoDB or RDS). And the filesize would be <20MB and there could be multiple(~10) such files uploaded periodically (once a day). I'm considering using…

amazon-web-services aws-glue aws-batch aws-data-pipeline

asked Oct 29 '19 at 23:42

user3035692

11
3

0

votes

1 answer

How to integrate Github with Data Catalog in AWS Glue

This question is about the Data Catalog of AWS Glue. I want to build a process like this: Connect Github to AWS Glue Data Catalog -> Pull Request about data catalog code(source) -> Merge -> Reflecting Modified Code in the AWS Glue Data Catalog ->…

github aws-glue aws-glue-data-catalog aws-data-pipeline

asked Oct 15 '19 at 13:03

J184937

67
1
7

0

votes

2 answers

Pass parameter to AWS Data pipeline - Built in template from Lambda function

I would like to create a data pipeline and that would be involked by lambda function. Data pipeline is "Load s3 data into RDS MYSQL ", Build using a template provided by AWS itself. From my lambda function, I'm not able to define the parameters to…

python-3.x amazon-web-services aws-lambda aws-data-pipeline

asked Sep 25 '19 at 11:44

Juhan

1,187
1
10
26

0

votes

1 answer

looking for a better way to visualize data lake pipeline on AWS

I am building a data lake pipeline on aws which includes many AWS services like s3, cloudwatch, lambda, glue crawler, glue job etc. The pipeline flow works like: - cloudwatch schedule a cron job to trigger a lambda to fetch external data and save…

amazon-web-services amazon-s3 aws-lambda aws-glue aws-data-pipeline

asked Aug 07 '19 at 23:36

Joey Yi Zhao

23,254
37
138
276

0

votes

1 answer

Create user with access to view in redshift

I’m pulling data from mysql ec2 instances, to s3 buckets, then creating views in redshift. I want to create database users who can only query and see certain views created specifically for them in Redshift. I have example code below that I use to…

amazon-s3 amazon-redshift aws-glue-data-catalog aws-data-pipeline

asked Jul 10 '19 at 18:21

user3476463

2,617
11
41
76

0

votes

2 answers

Copy data from PostgreSQL to S3 using AWS Data Pipeline

I am trying to copy all the tables from a schema (PostgreSQL, 50+ tables) to Amazon S3. What is the best way to do this? I am able to create 50 different copy activities, but is there a simple way to copy all tables in a schema or write one pipeline…

amazon-web-services amazon-rds aws-data-pipeline

asked Mar 29 '19 at 00:48

Visss

23
8

0

votes

1 answer

Has anyone used AWS system manager parameter in data pipeline, to allocate value to a parameter in pipeline?

"id": "myS3Bucket", "type": "String", "default": "\"aws ssm get-parameters --names variable --query \"Parameters[*].{myS3Bucket:Value}\"\"" I tried this , Where I created a variable in AWS parameter and was able to retrieve the value using this…

aws-data-pipeline

asked Mar 14 '19 at 18:54

user11204973

41
3

0

votes

2 answers

AWS MySQL to GCP BigQuery data migration

I'm planning a Data Migration from AWS MySQL instances to GCP BigQuery. I don't want to migrate every MySQL Database because finally I want to create a Data Warehouse using BigQuery. Would exporting AWS MySQL DB to S3 buckets as csv/json/avro, then…

mysql amazon-web-services google-cloud-platform google-bigquery aws-data-pipeline

asked Mar 10 '19 at 14:11

Buddhika Sameera

3
1

0

votes

2 answers

AWS data pipeline: dump data to 3 s3 nodes

I have a use case wherein I want to take a data from DynamoDB and do some transformation on the data. After this I want to create 3 csv files (there will be 3 transformations on the same data) and dump them to 3 different s3 locations. My…

amazon-web-services aws-lambda aws-data-pipeline

asked Feb 22 '19 at 05:47

paramvir

256
4
20

0

votes

1 answer

AWS DataPipeline insert status with SQLActivity

I am looking for a way to record the status of the pipeline in a DB table. Assuming this is a very common use case. Is there any way where I can record status and time of completion of the complete pipeline. status and time of completion of…

sql-server amazon-web-services aws-data-pipeline

asked Feb 19 '19 at 21:52

PyRaider

457
2
9
18

0

votes

1 answer

how to give access to a user in one AWS account to AWS datapipeline in another account?

I have two aws accounts. I have a user in account a which needs to have full access to aws data pipeline in account B. How to achieve this? I have attached a policy to the user in account A to have access to data pipeline. But how do I attach a…

amazon-web-services aws-data-pipeline

asked Nov 26 '18 at 14:20

Njoi

165
1
1
12

Questions tagged [aws-data-pipeline]