Daily scala: traits

Showing posts with label traits. Show all posts

Tuesday, April 13, 2010

Creating Custom Traversable implementations

One of the most talked about features of Scala 2.8 is the improved Collections libraries. Creating your own implementation is trivial, however if you want your new collection to behave the same way as all the included libraries there are a few tips you need to be aware of.

Note: All of these examples can either be ran in the REPL or put in a file and ran

Starting with the simple implementation:

import scala.collection._
import scala.collection.generic._
class MyColl[A](seq : A*) extends Traversable[A] {
    // only abstract method in traversable is foreach... easy :) 
  def foreach[U](f: A => U) = util.Random.shuffle(seq.toSeq).foreach(f)
}

This is a silly collection I admit but it is custom :).

This example works but if you test the result of a map operation (or any other operation that returns a new instance of the collection) you will notice it is not an instance of MyColl. This is expected because unless otherwise defined Traversable will return a new instance of Traversable.

To demonstrate run the following tests:

val c = new MyColl(1, 2, 3)
println (c mkString ",")
println(c mkString ",")
println(c drop 1 mkString ",")
// this two next assertions fail (see following explanation)
assert(c.drop(1).isInstanceOf[MyColl[_]])
assert((c map {_ + 1}).isInstanceOf[MyColl[_]])

Both assertions will fail. The reason for these failures is because the collection is immutable which dictates by necessity that a new object must be returned from filter/map/etc... Since the Traversable trait returns instances of Traversable these two assertions fail. The easiest way to make these methods return an instance of MyColl is to make the following changes/additions.

import scala.collection._
import scala.collection.generic._
/*
Adding GenericTraversableTemplate will delegate the creation of new
collections to the companion object.  Adding the trait and
companion object causes all the new collections to be instances of MyColl
*/
class MyColl[A](seq : A*) extends Traversable[A] 
                             with GenericTraversableTemplate[A, MyColl] {
  override def companion = MyColl
  def foreach[U](f: A => U) = util.Random.shuffle(seq.toSeq).foreach(f)
}
// The TraversableFactory trait is required by GenericTraversableTemplate
object MyColl extends TraversableFactory[MyColl] {
/* 
If you look at the signatures of many methods in TraversableLike they have an
implicit parameter canBuildFrom.  This allows one to define how the returned collections
are built.  For example one could make a list's map method return a Set
In this case we define the default canBuildFrom for MyColl
*/
  implicit def canBuildFrom[A]: CanBuildFrom[Coll, A, MyColl[A]] = new GenericCanBuildFrom[A]
/*  
The method that builds the new collection.  This is a simple implementation
but it works.  There are other implementations to assist with implementation if
needed
*/
  def newBuilder[A] = new scala.collection.mutable.LazyBuilder[A,MyColl[A]] {
    def result = {
      val data = parts.foldLeft(List[A]()){(l,n) => l ++ n}
      new MyColl(data:_*)
    }
  }
}

Now instances of MyColl will be created by the various filter/map/etc... methods and that is fine as long as the new object is not required at compile-time. But suppose we added a method to the class and want that accessible after applying methods like map and filter.

Adding val o : MyColl[Long] = c map {_.toLong} to the assertions will cause a compilation error since statically the class returned is Traversable[Long]. The fix is easy.

All that needs to be done is to add with TraversableLike[A, MyColl[A]] to MyColl and we are golden. There may be other methods as well but this works and is simple.

Note that the order in which the traits are mixed in is important. TraversableLike[A, MyColl[A]] must be mixed in after Traversable[A]. The reason is that we want methods like map and drop to return instances of MyColl (statically as well as dynamically). If the order was reversed then those methods would return Traversable event though statically the actual instances would still be MyColl.

import scala.collection._
import scala.collection.generic._
class MyColl[A](seq : A*) extends Traversable[A]
                             with GenericTraversableTemplate[A, MyColl] 
                             with TraversableLike[A, MyColl[A]] {
  override def companion = MyColl
  def foreach[U](f: A => U) = util.Random.shuffle(seq.toSeq).foreach(f)
}
object MyColl extends TraversableFactory[MyColl] {  
  implicit def canBuildFrom[A]: CanBuildFrom[Coll, A, MyColl[A]] = new GenericCanBuildFrom[A]
  def newBuilder[A] = new scala.collection.mutable.LazyBuilder[A,MyColl[A]] {
    def result = {
      val data = parts.foldLeft(List[A]()){(l,n) => l ++ n}
      new MyColl(data:_*)
    }
  }
}

Now add in a new method to demonstrate that the new collection works as desired and we are done.

The following is the complete implementation with the tests. You can put it in a file and run scala <filename> or paste it into a REPL

import scala.collection._
import scala.collection.generic._
import scala.collection.mutable.{ Builder, ListBuffer }
class MyColl[A](seq : A*) extends Traversable[A]
                             with GenericTraversableTemplate[A, MyColl] 
                             with TraversableLike[A, MyColl[A]] {
  override def companion = MyColl
  def foreach[U](f: A => U) = util.Random.shuffle(seq.toSeq).foreach(f)
  def sayhi = println("hi!")
}
object MyColl extends TraversableFactory[MyColl] {  
  implicit def canBuildFrom[A]: CanBuildFrom[Coll, A, MyColl[A]] = new GenericCanBuildFrom[A]
  def newBuilder[A] = new ListBuffer[A] mapResult (x => new MyColl(x:_*))
}
val c = new MyColl(1, 2, 3)
println (c mkString ",")
println(c mkString ",")
assert(c.drop(1).isInstanceOf[MyColl[_]])
assert((c map {_ + 1}).isInstanceOf[MyColl[_]])
val o : MyColl[Int] = c filter {_ < 2}
println(o mkString "," )
o.sayhi

Monday, March 1, 2010

NullPointer when mixed traits (Warning)

This tip is mainly to document a 'GOTCHA' that I got caught by recently. It basically goes like this:

Trait Y extends(or has self-type) X. Trait X defines some abstract method 'm'. The initialization code in Y accesses 'm'. Creation of an object new X with Y results in: *Boom* NullPointerException (on object creation).

The example in code:

scala> trait X { val x : java.io.File }
defined trait X
scala> trait Y {self : X => ; val y = x.getName} 
defined trait Y
scala> new X with Y { val x = new java.io.File("hi")}
java.lang.NullPointerException
 at Y$class.$init$(< console>:5)
 at $anon$1.< init>(< console>:7)
 ...

At a glance it seems that x should override the abstract value x in trait X. However the order in which traits are declared is important. In this case first Y is configured then X. Since X is not yet configured Y throws an exception. There are several ways to work around this.
Option 1:

trait X {val x : java.io.File}
trait Y {self : X => ; val y = x.getName}
/*
Declaring Y with X will work because Y is initialized after X
but remember that there may
be other reasons that X with Y is required.  
Method resolution is one such reason
*/
new Y with X { val x = new java.io.File("hi")}

Option 2:

trait X { val x : java.io.File }
trait Y {self : X => ; def y = x.getName}
/*
Since method y is a 'def' x.getName will not be executed during initialization.
*/
scala> new X with Y { val x = new java.io.File("hi")}
res10: java.lang.Object with X with Y = $anon$1@7cb9e9a3

Option 3:

trait X { val x : java.io.File }
trait Y {self : X => ; lazy val y = x.getName}
/*
'lazy val' works for the same reason 'def' works: x.getName is not invoked during initialization
*/
scala> new X with Y { val x = new java.io.File("hi")}
res10: java.lang.Object with X with Y = $anon$1@7cb9e9a3

Option 4:

trait X {val x : java.io.File }
trait Y extends X {def y = x.getName}
/*
if Y extends X then a new Y can be instantiated
*/
new Y {val x = new java.io.File("hi")}

Two more warnings. First, the same error will occur whether 'x' is a def or a val or a var.

trait X { def x : java.io.File }   
trait Y {self : X => ; val y = x.getName}     
new X with Y { val x = new java.io.File("hi")}

Second warning: In complex domain models it is easy to have a case where Y extends X but the final object is created as: new X with Y{...}.

You will get the same error here because (I think) the compiler recognized that Y is being mixed in with X and therefore the X will be initialized as after Y instead of before Y.

First the code:

trait X { def x : java.io.File }   
trait Y extends X { val y = x.getName}        
new X with Y { val x = new java.io.File("hi")}

If the code instantiated new Y{...} the initialization would be X then Y. Because X can only be initialized once, the explicit declaration of new X with Y forces Y to be initialized before X. (X can only be initialized once even when it appears twice in the hierarchy).

This is a topic called linearization and will be addressed in the future.

Wednesday, September 9, 2009

Using objects to access trait functionality

Today's topic is based on an article by Bill Venners. http://www.artima.com/scalazine/articles/selfless_trait_pattern.html. I recommend reading that article as it goes into much more detail. I also recommend taking a look at the earlier topic that covers companion objects.

The normal way to use a trait is to mix it in to an object. However there can be a problem mixing two traits containing methods with equal signatures. If the two traits are not designed to work together then you will get a compile error. Otherwise one method will override the other. Either way you cannot access both methods. There is an additional way to access the functionality of a trait. You can create an object (not instance) that extends the trait and import the methods when you need them.

If the trait is stateless then the object can be shared if not then make sure that sharing the object is carefully handled.

Examples:

scala> trait T1 {
     | def talk = "hi"
     | }
defined trait T1
scala> trait T2 {
     | def talk = "hello"
     | }
defined trait T2
// Cannot extend C with T1 and T2 because they are not designed to work together
scala> class C extends T1 with T2
:6: error: error overriding method talk in trait T1 of type => java.lang.String;
 method talk in trait T2 of type => java.lang.String needs override modifier
       class C extends T1 with T2
             ^
scala> class C extends T1
defined class C
// objects can have state so becareful how you share them
scala> object Obj1 extends T1
defined module Obj1
scala> object Obj2 extends T2
defined module Obj2
// You can give aliases to the imported methods and use them in the class
scala> class C {
     | import Obj1.{talk => hi}
     | import Obj2.{talk => hello}
     | def sayHi = hi
     | def sayHello = hello
     | }
defined class C
scala> val c = new C
c: C = C@54d8fd1a
scala> c.sayHi
res0: java.lang.String = hi
scala> c.sayHello
res1: java.lang.String = hello
scala> class C extends T1 {
     | import Obj2.{talk => hello}
     | def helloTalk = hello
     | }
defined class C
scala> val c2 = new C
c2: C = C@2ee634bf
scala> c2.talk
res2: java.lang.String = hi
scala> c2.helloTalk
res5: java.lang.String = hello

Thursday, September 3, 2009

Adding methods using Traits

One useful application of a trait is the case where you want to add functionality to an existing class. In this example I have a class provided by a third party library (in this just a simple StringReader class from the Java library). But I want to be able to read lines as well as use the standard read methods.

One solution is to create a trait and when I instantiate the StringReader mix in the new trait. Code like new StringReader() with Lines results in a new class that extends StringReader and the trait Lines. As a result we have all the methods of StringReader and Lines. The biggest benefit is that we can define the trait to work with any Reader and then when we create the real instance we can mix it in to any class that extends Reader.

The other solution that can be used is to create an implicit conversion from StringReader to some other class. There are two draw backs here:

It is harder to tell what is happening
A trait can contain state but a "view" (which is what you get when one class is implicitly converted to another class) has no state it is just a view of the original class. In this example a view would work but in other examples like creating a pushback reader, it would not work.

Here is a simple example:

scala> trait Lines {
     | // the self type declares what type of class the trait can be applied to
     | // if there is no self type then it is assumed it can be applied to Any type
     | self:java.io.Reader =>
     | def nextLine:Option[String] = {
     | val builder = new scala.collection.mutable.StringBuilder()
     |
     | var next = read()
     |
     | while( next != -1 && next.toByte.toChar != '\n' ){
     | builder += next.toByte.toChar
     | next = read()
     | }
     |
     | if( builder.isEmpty ) None
     | else Some(builder.toString)
     | }
     | }
defined trait Lines
// Strings starting and ending with (""") are called raw strings.  All characters 
// within """ """ are automatically escaped.
// I am creating a reader and mixing in the Lines trait on construction
scala> val reader = new java.io.StringReader( """line one
     | line two""" with Lines
reader: java.io.StringReader with Lines = $anon$1@3850620f
scala> reader.nextLine
res0: Option[String] = Some(line one)
scala> reader.nextLine
res1: Option[String] = Some(line two)
scala> reader.nextLine
res2: Option[String] = None
scala> reader.nextLine
res3: Option[String] = None
// we can define a method that takes a reader with lines
scala> def toCollection( reader:java.io.StringReader with Lines) = {
     | def collect:List[String] = reader.nextLine match {
     |   case None => Nil
     |    // we do not need to worry about stack overflow
     |    // because of tail recursion.  This method cannot be
     |    // extended and collect is the last like in the collect
     |    // method so this method will be transformed into a loop
     |   case Some( line ) => line :: collect
     | }
     |
     | collect
     | }
toCollection: (reader: java.io.StringReader with Lines)List[String]
scala> toCollection( new java.io.StringReader( "line one\nlinetwo" ) with Lines).size
res8: Int = 2

Thursday, August 13, 2009

Traits and inheritance

Scala provides two structures for inheritance. Classes (abstract or not) and traits. Traits are very similar to Ruby Mixins meaning that they can contain code like abstract classes but like interfaces multiple traits can be inherited from.
Like interfaces traits cannot have constructors but in Scala variables can be abstract and therefore provide an easy way to simulate a constructor.
There are no method resolution conflicts because method definitions are always resolved right to left:
class X extends Y with A with B with C
If ABC and Y all have the method (doit) the method in C will be used. If C calls super.doit that will call B.doit and so on.
Note: If Y defines doit the A, B, and C must define doit with the override keyword:
override def doit() = {...}
In the above example ABC must be traits but Y can be a class or a trait. When inheriting you must always have one extends keyword which can optionally followed by one or more with clauses.

scala> abstract class Animal {
     |  val legs:Int
     | val noise:String
     | def makeNoise() = println(noise)
     | }
defined class Animal
scala>  trait Quadriped {
     | self:Animal =>
     | val legs = 4
     | }
defined trait Quadriped
scala> trait Biped {
     | self:Animal =>
     | val legs = 2
     | }
defined trait Biped
scala> class Dog extends Animal with Quadriped {
     | val noise = "Woof"
     | override def makeNoise() = println( noise+" "+noise)
     | }
defined class Dog
scala> new Dog().makeNoise()
Woof Woof
scala> abstract class GenericAnimal extends Animal{ 
     | val noise = "glup"                          
     | }
defined class GenericAnimal
scala> val quad = new GenericAnimal() with Quadriped
quad: GenericAnimal with Quadriped = $anon$1@10bfb545
scala> quad.makeNoise()
glup
scala> val biped = new GenericAnimal() with Biped
biped: GenericAnimal with Biped = $anon$1@7669521
scala> val biped = new GenericAnimal() with Biped{
     | override val noise = "Hello"
     | }
biped: GenericAnimal with Biped = $anon$1@6366ce5f
scala> biped.makeNoise()
Hello