I am sure the first question arising in any sane person’s mind is why are we doing it? normally everyone goes the other way, we would want to make our cluster more secure and kerberize the cluster. And here I am, going the other way. It is due to change in how we want to use the clusters in the future.
- All of our clusters are no-more multi-tenant, so losing authorization was lesser of a concern. We no longer needed to secure data between applications.
- We used Redhat’s IPA as our Identity Provider and since we were moving to Centos, we wanted to get rid on IPA as well.
An important point to keep in mind is that, authorization does not work without authentication, for obvious reasons. So, removing kerberos means, we no longer can use Sentry as well for granting access to databases and table.
I have covered the changes made to cluster configuration through Cloudera manager, but I am sure same changes can be easily made using Ambari as well.
Hadoop Properties to change
Service | Property Name | Property Value |
---|---|---|
Zookeeper | enableSecurity (Enable Kerberos Authentication) |
False (uncheck) |
HDFS | hadoop.security.authentication |
Simple |
HDFS | hadoop.security.authorization |
False (uncheck) |
HDFS | dfs.datanode.address |
from 1004 (for Kerberos) to 50010 (default) |
HDFS | dfs.datanode.http.address |
from 1006 (for Kerberos) to 50075 (default) |
HDFS | Data Directory Permissions | from 700 to 755 |
Hbase | hbase.security.authentication |
Simple |
Hbase | hbase.security.authorization |
Simple |
Solr | Solr Secure Authentication | Simple |
Kafka | kerberos.auth.enable | False |
Hue | Kerberos Ticket Renewer | Stop and delete Role |
After removing kerberos authentication, to maintain the behavior in which all yarn container are executed as user submitting the yarn job and not as nobody
user, we have to make changes in below parameters.
Service | Property Name | Property Value |
---|---|---|
Yarn | yarn.nodemanager.linux-container-executor.nonsecure-mode.local-user |
yarn |
Yarn | yarn.nodemanager.linux-container-executor.nonsecure-mode.limit-users |
false |
In CDH 5.11 , second parameter needs to set in safety and NOT just unchecking the parameter in Cloudera Manager, just edit parameter YARN Service Advanced Configuration Snippet (Safety Valve)
for yarn-site.xml
As we are removing kerberos authentication, we have to get need to stop and delete Sentry which provides authorization. Before stopping and deleting the sentry service, make below changes in the configuration.
Service | Property Name | Property Value |
---|---|---|
Solr | Sentry Service | none |
Hive | Sentry Service | none |
Impala | Sentry Service | none |
Hue | Sentry Service | none |
Kafka | Sentry Service | none |
Clear yarn usercache - follow the steps detailed here in
Cloudera community article. Delete the all the usercache carefully on all the nodes and all directories, here is a sample command I used on the hosts with nodemanager roles rm -rf /mnt/dsk/*/yarn/nm/usercache/*'
Along with the Cluster changes, after disabling Kerberos, some code changes are also required. All users interacting with the cluster need to update the configs.
- Impala - impala-shell connection need to remove -k flag while connecting to impala
- Spark - in spark-submit, flag for
--keytab
and--principal
need to removed from the command - Oozie - Add config
user.name
to workflow config to make user actions are executed as the specified user.
There are more changes based on different access patterns to cluster, but most of the changes follow either of these two themes.
- Ignore or remove the parameters which passes the kerberos principal and keytab for authentication to the service.
- Since on a kerberos cluster, principal specifies the identification on user which is trying to access to cluster services, hence after removing the kerberos authentication, we need to add parameters like
user.name
or define the environment variableHADOOP_USER_NAME
So, this is my experience for un kerberising the hadoop cluster, please let me know if you find this helpful and share your experiences.