Questions tagged with Amazon EMR

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

How should i configure my emr cluster to handle large data

I have an EMR cluster and I have used the treasure data connector to read data from table into dataframe using pyspark. Now these tables that I'm trying to read have approximately 100 million to 500...

Amazon EMR

answers

votes

100

views

Nakshtra

asked 3 days ago

EMR Jupyter Notebook: PySpark Imports Work in Shell, Not in Notebook- Issue is importing custom files

Issue: PySpark works in the first cells (likely SparkSession creation) but throws import errors when using my Python files in later cells. Environment: AWS EMR ( Amazon EMR...

Amazon EMR

answers

votes

118

views

Harish

asked 9 days ago

Studio Workspace can't see my runnning EMR EC2 cluster to attach to

Let me know if this is something AWS EMR Studio does: 1. in Databricks community edition, and in Google Collab, one can fire up a simple Jupyter notrebook with an automatically started cluster (small...

Amazon WorkSpaces Amazon EMR Amazon EMR Serverless

answers

votes

137

views

ken cottrell

asked 15 days ago

AWS EMR - YARN Resource Issue

Hi everyone, I am using AWS EMR to do some ETL operations on very large datasets (like millions/billions of records). I am using PySpark and reading the csv files using *spark.read.csv*. The results...

Amazon EMR Compute

answers

votes

197

views

vsk95

asked 17 days ago

Serverless job failure

While running the serverless job run, I am getting below errror: "Number of cores specified by 'spark.driver.cores '7' is invalid".

Amazon EMR Amazon EMR Serverless

answers

votes

201

views

Akash

asked 19 days ago

refresh_hfiles not working

Hi I have a EMR with Hbase on S3 storage mode.I have a read replica cluster pointing to same S3 bucket. Now when I add record in primary cluster and flush table on primary, and then run refresh_hfiles...

Amazon EMR Database AWS IAM Identity Center Amazon S3 Access Grants

answers

votes

226

views

shushant

asked 21 days ago

AWS EMR WAL creation error

Hi I am getting error while launching EMR with Hbase as S3Storage and WAL backup enabled . Caused by: java.lang.RuntimeException: createWal failed for wal WALMetadata(WALWorkspace=testworkspace2,...

AWS Identity and Access Management Developer Tools Amazon EMR IAM Policies

answers

votes

325

views

shushant

asked 21 days ago

I have a Python package saved in CodeCommit and I need it to run in the notebook linked to an EMR cluster.

I have a Python package saved in CodeCommit and need to use it in the notebook attached to my EMR cluster workspace. The package is already successfully installed via bootstrap. To do this, in my .sh...

AWS CodeCommit Amazon EC2 Amazon EMR Amazon EMR Studio

answers

votes

265

views

amanda_oliveira

asked a month ago

How do I connect Amazon mq to AWS emr serveless?

I have a Serverless EMR appication, I am submitting a spark job via python script. I have packaged all the dependencies an an the script to an s3 bucket. When I execute the job the spark job is...

Amazon EMR Amazon MQ Amazon EMR Serverless

answers

votes

296

views

Tushar

asked a month ago

Unable to run iceberg insert in hive deployed on EMR

Hello, I configured iceberg formatted table with transaction in hive on EMR 6.4.1. When I insert data into the table, the operation get stuck, without any error. Any insights are highly...

Accepted AnswerAmazon EMR

answers

votes

305

views

Mark

asked a month ago

JupyterHub version issue

I've started seeing the following error on JupyterHub on EMR `TypeError: required field "type_ignores" missing from Module` from the simplest commands ![the...

Amazon EMR

answers

votes

300

views

Dev

asked a month ago

No containers running but still instance not decommissioned

Hi Team, We have EMR 6.10 cluster where flink jobs submitted to existing application. Container was running in task node in my case. Then I resized the task instance group from 1 to 0 in task instance...

Accepted AnswerAmazon EMR

answers

votes

272

views

Scott M

asked 2 months ago

1
2
3
4
5
•••
25
12 / page