I am sure the first question arising in any sane person’s mind is why are we doing it? normally everyone goes the other way, we would want to make our cluster more secure and kerberize the cluster. And here I am, going the other way. It is due to change...
[Read More]
Application Template for Spark Scala with Gradle
Sharing my tips for developing an application on local desktop and automating the complete build process using Apache Spark with Scala
Apache Spark is the unified analytics engine for large scale data processing. Although it supports writing applications in Java, Scala, Python, R, SQL; Scala is preferred by many(especially by me) to develop applications due to its engine’s nativity. Furthermore, developing applications utilizing Spark on local desktop, automating build, testing and...
[Read More]
How to monitor network packet drops in a Cluster
monitor network packet drops in cloudera cluster
Hadoop clusters, or for that matter, any cluster which perform data processing does put a lot of pressure on the network infrastructure. So we need to understand and monitor and tune the network performance of the nodes of the cluster.
[Read More]
Use of jq in aws cli
jq is resourceful and helps in getting all the operations done, which query options fails
[Read More]
RMAN-11003: failure during parse/execution of SQL statement: alter database recover logfile
issues while rman restore/recovery
We got the below error , while doing the backup based rman duplicate.
RMAN command for duplicate is below.
[Read More]