Spring Batch – A Batch Framework Introduction

Spring Batch

                     In this article we will see Spring Batch, a batch framework built on top of Spring framework. Before getting into Spring Batch, let us understand What is batch programming? A batch application is one which will perform set of jobs without manual intervention. Let us take typical shopping site where we will get product catalog feed as a flat file. We need to process the feed file and upload the data to shopping site product catalog database. In this case we can write set of tasks to read the feed file, transform the data and then upload it to database behind the scenes. Spring Batch framework provides necessary infrastructure to accomplish such kind of batch processing. Now, we will see the terminology used in Spring Batch framework.

Job: A job is a process which will perform bunch of tasks. For example, Product Catalog processing is a job. This job has tasks like, basic product info import task, product specifications import task, price info import task etc..

Step: A step is an individual piece of task. For example, import price info as part of product catalog import job.

Item: Item is an individual entity. For example, product is an item in the product catalog.

Chunk: Bunch of items are nothing but a chunk.

Job Repository: Job repository keeps track of job execution, status etc.. It has information such as job success, failure, when the failure occured, from where the job has to restart etc…

ItemReader: Reader reads the data from any data source(eg: flat file, xml file, database etc..)

ItemProcessor: Processor transforms the data before writing it to the data source. The ItemProcessor is optional in the Spring Batch job configuration.

ItemWriter: Writer writes the processed items to data source(eg: flat file, xml file, database etc…).

The below sequence diagram depicts the interaction of ItemReader, ItemProcessot and ItemWriter.

Spring Batch

The ItemReader, ItemProcessor and the ItemWriter interface implementations has to be provided to the Spring Batch job. The sample job configuration is given below.

<batch:job id="job1">
   <batch:step id="step1">
      <batch:tasklet transaction-manager="transactionManager">
         <batch:chunk reader="reader" writer="writer" processor="processor" commit-interval="1" />
     </batch:tasklet>
   </batch:step>
</batch:job>

The features of Spring Batch is given below:

  • Transaction management
  • Chunk based processing
  • Declarative I/O
  • Start/Stop/Restart
  • Retry/Skip
  • Web based administration interface (Spring Batch Admin)

In the coming articles will see sample Spring Batch applications. Till then stay tune.

Advertisements

I am Siva Prasad Rao Janapati. Working as a software developer. Has hands on experience on ATG Commerce(DAS/DPS/DCS), Mozu commerce, Broadleaf Commerce, Java, JEE, Spring, Play, JPA, Hibernate, Velocity, JMS, Jboss, Weblogic,Tomcat, Jetty, Apache, Apache Solr, Spring Batch, JQuery, NodeJS, SOAP, REST, MySQL, Oracle, Mongo DB, Memcached, HazelCast, Git, SVN, CVS, Ant, Maven, Gradle, Amazon Web services, Rackspace, Quartz, JMeter, Junit, Open NLP, Facebook Graph,Twitter4J, YouTube Gdata, Bazzarvoice,Yotpo, 4-Tell, Alatest, Shopzilla, Linkshare. I have hands on experience on open sources and commercial technologies.

Tagged with:
Posted in Spring, Spring Batch
2 comments on “Spring Batch – A Batch Framework Introduction
  1. Anonymous says:

    Really appreciate your blog post.

  2. Henrymckenney says:

    Thanks for sharing the information.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

DZone

DZone MVB

Java Code Geeks
Java Code Geeks
%d bloggers like this: