Maximum size of call stack keeps varying in clojure [duplicate] - clojure

This question already has answers here:
Why is the max recursion depth I can reach non-deterministic?
(4 answers)
Why does the count of calls of a recursive method causing a StackOverflowError vary between program runs? [duplicate]
(3 answers)
Closed 3 years ago.
We wanted to check the max depth of the call stack on JVM, for that we ran the following
(defn abcd [i]
"A function to find the max recursion allowed"
(try (abcd (inc i))
(catch Throwable t
(println "Failed at" i)
(prn (class t)))))
I expected the value of i to be same all the time because well the stack size is always going to be the same! (would be ok with small differences but here I saw the depth varying a lot)
(abcd 0)
Failed at 4852
java.lang.StackOverflowError
(abcd 0)
Failed at 4917
java.lang.StackOverflowError
(abcd 0)
Failed at 23609
java.lang.StackOverflowError
(abcd 0)
Failed at 23620
java.lang.StackOverflowError
The depth was lesser in the first run and then was consistently at the 20k range. What optimisation would JVM be doing here?

Related

Getting error with Fibonacci Function in Clojure?

I am trying to print the value of the fibonacci number at position 400, using memoization. At first, I was only able to print the number until the 94th position, because I got an integer overflow error. After reading online, I am now able to print the value up to position 247 by converting into BigInt. But for some reason, when I try 248 or above, I get a huge error. Here is my code:
(def m-fib
(memoize (fn [n]
(condp = n
0N 0N
1N 1N
(+ (m-fib (dec n)) (m-fib (- n 2)))))))
(println(m-fib 248N))
And here is a sample of the error I get (It's too long to put all of it into the question, and it won't specifically tell me what the problem is.
at clojure.lang.AFn.applyToHelper(AFn.java:154)
at clojure.lang.AFn.applyTo(AFn.java:144)
at clojure.core$apply.invokeStatic(core.clj:646)
at clojure.core$memoize$fn__5708.doInvoke(core.clj:6107)
at clojure.lang.RestFn.invoke(RestFn.java:408)
at fibonacci.core$_main$fn__28.invoke(core.clj:11)
at clojure.lang.AFn.applyToHelper(AFn.java:154)
at clojure.lang.AFn.applyTo(AFn.java:144)
at clojure.core$apply.invokeStatic(core.clj:646)
at clojure.core$memoize$fn__5708.doInvoke(core.clj:6107)
at clojure.lang.RestFn.invoke(RestFn.java:408)
at fibonacci.core$_main$fn__28.invoke(core.clj:11)
at clojure.lang.AFn.applyToHelper(AFn.java:154)
at clojure.lang.AFn.applyTo(AFn.java:144)
at clojure.core$apply.invokeStatic(core.clj:646)
at clojure.core$memoize$fn__5708.doInvoke(core.clj:6107)
at clojure.lang.RestFn.invoke(RestFn.java:408)
at fibonacci.core$_main$fn__28.invoke(core.clj:11)
at clojure.lang.AFn.applyToHelper(AFn.java:154)
at clojure.lang.AFn.applyTo(AFn.java:144)
at clojure.core$apply.invokeStatic(core.clj:646)
at clojure.core$memoize$fn__5708.doInvoke(core.clj:6107)
at clojure.lang.RestFn.invoke(RestFn.java:408)
at fibonacci.core$_main$fn__28.invoke(core.clj:11)
at clojure.lang.AFn.applyToHelper(AFn.java:154)
at clojure.lang.AFn.applyTo(AFn.java:144)
at clojure.core$apply.invokeStatic(core.clj:646)
at clojure.core$memoize$fn__5708.doInvoke(core.clj:6107)
at clojure.lang.RestFn.invoke(RestFn.java:408)
at fibonacci.core$_main$fn__28.invoke(core.clj:11)
at clojure.lang.AFn.applyToHelper(AFn.java:154)
at clojure.lang.AFn.applyTo(AFn.java:144)
How can I fix this and output the fib value at position 400 without any errors?
Your m-fib creates a tree that grows exponentially with the input.
Source: http://www.sicpdistilled.com/section/1.2.2/
The space used by m-fib grows so large that it blows up the stack (StackOverflow) since all intermediate values are kept on the stack, before they are added.
Your function would work fine it is used tail-recursion. This means that intermediate results are calculated every time and put on the stack. Using tail-recursion you don't need space on the stack, since only the intermediate result is stored.
I did't see a way to implement in your structure since it basically uses the stack size to calculate the answer (it spreads out to a large amount of base cases where N are 0 and 1). Here's a tail recursive other way as an example.
Some languages (like Scheme) do automatic tail-call optimization. Clojure also has limited support for tail recursion (because of limitations of the JVM) as long as you use loop and recur (as also said in the answer by zabeltech.
In Clojure we can only use tail recursion in the tail position (last expression). This can, for example, be done by creating an iterative tail-recursive function where we pass the intermediate results along:
(defn m-fib [n]
(m-fib-iter 1 0 n))
(def m-fib-iter
(memoize
(fn [a b count]
(if (= count 0)
b
(recur (+' a b) a (- count 1)))))) ; note the +'
(fib 248) ;; => 3016128079338728432528443992613633888712980904400501N
Here a and b are updated every recurring round.
This answer was based on the Clojure text based on the Structure and Interpretation of Computer programs at SICP distilled. I only added memoization as you did as well, so that previously calculated answers are stored.
See that site for more information.
#zabeltech is right, you reach the limit of the recursion.
In your case the best thing you can do without changing your code is to precall your fib function with some values lower than your desired index, to get memoized data later, instead of recursively calculating it:
user> (m-fib 100N)
354224848179261915075N
user> (m-fib 248N)
3016128079338728432528443992613633888712980904400501N
you can see that the 100th fib doesn't cause stackoverflow, and then 248th also doesn't (because it doesn't go down to 0 recursively, it only goes down to 100, and 100th value (and below) are already memoized)
Also, clojure has some much more elegant approaches to generating fibs, like this for example:
user> (def fibs (lazy-cat [0 1N] (map + fibs (rest fibs))))
#'user/fibs
user> (nth fibs 248)
3016128079338728432528443992613633888712980904400501N
I get a stack overflow error when I try to advance the memoization by a thousand or so (YMMV). This is because the recursion spins up a stack until it finds the largest memoized argument-> value.
Shlomi's comment refers to solutions that yield a sequence. If we want to preserve constant time access, we can build our own lazy vector. I've worked out a way to do this, whereby you supply
an initial sequence of values and
a function that generates the next value from the preceding however
many.
You get a function that
encloses a vector inside an atom that grows to reach any offered
argument.
And never forgets what it has done.
The function-making function is ...
(defn mem-vector [f inits]
(let [mem (atom (vec inits))]
(fn [n]
(let [content- #mem, size- (count content-)]
(loop [content content-, size size-]
(if (> size n)
(do
(if (> size size-) (reset! mem content))
(content n))
(recur (conj content (apply f (subvec content (- size (count inits))))) (inc size))))))))
To generate fibonacci numbers,
the initial values are 0 and 1 and
the function that generates the next value from the previous two is
+'.
So a lazy vector of all the fibonacci numbers is
(def fibs (mem-vector +' [0 1]))
For example
(fibs 100000)

It ran out of heap space when I tried 1000000 :(.
A more readable example is ...
(map fibs (range 20))
;(0 1 1 2 3 5 8 13 21 34 55 89 144 233 377 610 987 1597 2584 4181)
The point is that this function gives you constant time access to an already found value. You don't have to run down a sequence.
Edited to correct an error whereby the buffer (correctly) expanded and (wrongly) contracted with each offered argument.
this is probably because you reached the maximum recursion limit. In order to go around this one could rewrite the function to be tail recursive and then use the loop-recur pattern.

The usage of lazy-sequences in clojure

I am wondering that lazy-seq returns a finite list or infinite list. There is an example,
(defn integers [n]
(cons n (lazy-seq (integers (inc n)))))
when I run like
(first integers 10)
or
(take 5 (integers 10))
the results are 10 and (10 11 12 13 14)
. However, when I run
(integers 10)
the process cannot print anything and cannot continue. Is there anyone who can tell me why and the usage of laza-seq. Thank you so much!
When you say that you are running
(integers 10)
what you're really doing is something like this:
user> (integers 10)
In other words, you're evaluating that form in a REPL (read-eval-print-loop).
The "read" step will convert from the string "(integers 10)" to the list (integers 10). Pretty straightforward.
The "eval" step will look up integers in the surrounding context, see that it is bound to a function, and evaluate that function with the parameter 10:
(cons 10 (lazy-seq (integers (inc 10))))
Since a lazy-seq isn't realized until it needs to be, simply evaluating this form will result in a clojure.lang.Cons object whose first element is 10 and whose rest element is a clojure.lang.LazySeq that hasn't been realized yet.
You can verify this with a simple def (no infinite hang):
user> (def my-integers (integers 10))
;=> #'user/my-integers
In the final "print" step, Clojure basically tries to convert the result of the form it just evaluated to a string, then print that string to the console. For a finite sequence, this is easy. It just keeps taking items from the sequence until there aren't any left, converts each item to a string, separates them by spaces, sticks some parentheses on the ends, and voilĂ :
user> (take 5 (integers 10))
;=> (10 11 12 13 14)
But as you've defined integers, there won't be a point at which there are no items left (well, at least until you get an integer overflow, but that could be remedied by using inc' instead of just inc). So Clojure is able to read and evaluate your input just fine, but it simply cannot print all the items of an infinite result.
When you try to print an unbounded lazy sequence, it will be completely realized, unless you limit *print-length*.
The lazy-seq macro never constructs a list, finite or infinite. It constructs a clojure.lang.LazySeq object. This is a nominal sequence that wraps a function of no arguments (commonly called a thunk) that evaluates to the actual sequence when called; but it isn't called until it has to be, and that's the purpose of the mechanism: to delay evaluating the actual sequence.
So you can pass endless sequences around as evaluated LazySeq objects, provided you never realise them. Your evaluation at the REPL invokes realisation, an endless process.
It's not returning anything because your integers function creates an infinite loop.
(defn integers [n]
(do (prn n)
(cons n (lazy-seq (integers (inc n))))))
Call it with (integers 10) and you'll see it counting forever.

"compiling" stacktrace error for clojure

I'm learning Clojure, and I find difficult to understand where a specific compiler error happens:
java.lang.ClassCastException: java.lang.Long cannot be cast to
clojure.lang.IPersistentCollection, compiling:(fwpd/core.clj:100:1)
Line 100 is just:
(fib-seq3 5)
So it says nothing, because in fact the error is in the fib-seq3 function (parameters to a "conj" call are inverted, see below).
Is this normal? No way to know where an error is???
Just for reference, here's the code (again, I know where the error is; I just don't understand how was I supposed to find it, given that the message doesn't tell me at which line it happens):
(defn fib-seq3
([to]
(fib-seq3 [] 0 1 0 to))
([coll a b k to]
(if (= k to)
coll
(fib-seq3 (conj b coll) b (+ a b) (inc k) to)))
(fib-seq3 5)
Stack traces in Clojure suck. In fact, error messages were rated by Clojure community as the top priority area for improvements, as well as Clojure most frustrating part.
This problem is not new. There was no considerable improvements in Clojure stack traces for quite a long time. But Clojure team is fully aware of this situation, so we could hope for improvements.
To better understand Clojure stack traces try reading Clojure Stack Traces for the Uninitiated. Though the article is somewhat old, it's still relevant.
In short, you should look for so-called "cause trace", which is a second part of any Clojure stack trace and starts with "Caused by" phrase.
The problem is that I was using REPL (Vim+Fireplace) to execute the code. Executing using lein repl fixed the problem.
#Leonid #amalloy:
(.printStackTrace *e)
gives the proper stacktrace in the REPL (even from inside Fireplace, using "cqp" which gives the REPL prompt), so thank you very much for the comment (didn't know that!)

(println (iterate inc 0)): why does this even start printing?

When I run (println (iterate inc 0)) in my repl, I will get something like this:
user=> (println (iterate inc 0))
(0 1 2 3 4 5 6 7 8 9 10 11 12 13 ....................
My expectation when I run the code is that repl shows nothing and just stuck because (iterate inc 0) never ends. But, I see (0 1 2 3 ....
(iterate inc 0) generates infinite sequence which never returns. If it never ends, then why println starts printing values?
In other words, why (println xx) is started being evaluated even if the input is never finished being evaluated?
You should read up on lazy seqs in Clojure. They're able to produce values that can be consumed incrementally before the whole sequence is realized (which, in this case, will never happen).
It might help to think of it as push vs pull. Instead of iterate creating an entire list of values and then pushing them to the println function (which would never happen), iterate just hands it a lazy sequence, and println pulls values as it needs them. This is why (take 5 (iterate inc 0)) works; take only tries to pull 5 values before stopping.
Clojure's printing is smarter than System.out.println; it can be customized for different types. In the case of sequences, it walks through element-by-element, printing each one as it goes - we don't have to wait until the entire sequence is evaluated to start printing things.
By contrast, System.out.println, which calls toString before printing, behaves more like you might expect. It hangs forever, not printing anything because toString needs to evaluate the entire sequence - or, at least, it would hang forever if it didn't run out of memory trying to build the string.
That said, the entire expression is indeed stuck - if you were waiting for it to stop printing, you'd wait forever:
(do
(println (iterate inc 0))
(println "Never reached!"))

OutOfMemoryError when using seque function

I have this function that reproduces my problem:
(defn my-problem
[preprocess count print-freq]
(doseq [x (preprocess (range 0 count))]
(when (= 0 (mod x print-freq))
(println x))))
Everything works fine when I call it with identity function like this :
(my-problem identity 10000000 200000)
;it prints 200000,400000 ... 9800000 just as it should
When I call it with seque function I get OutOfMemoryError :
(my-problem #(seque 5 %) 10000000 200000)
;it prints numbers up to 2000000 and then it throws OutOfMemoryException
My understanding is that seque function should just split the processing into two threads using ConcurrentBlockingQueue with max size 5 (in this case). I don't understand where the memory leak is.
The way seque is implemented, if you consume elements much more quickly than you can produce them, a large number of agent tasks will pile up in the queue used internally by seque (up to one task per element in the sequence). In theory what you're doing should be fine, but in practice it doesn't really work out. You should be able to see the same effect just by running (dorun (seque (range))).
You can also use the function sequeue in flatland/useful, which makes tradeoffs that are different from the ones in clojure.core. Read the docstring carefully, but I think it would work well for your situation.