Hbase indexer for SOLR

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
11 messages Options
Reply | Threaded
Open this post in threaded view
|

Hbase indexer for SOLR

F. T.
Hi all,
I want to extract fields from PDF files and store it into Hbase. Then I want to link the database with a Solr collection. To do this, I installed Hbase-indexer.
What is the best way to do it ?
Actually, I can write data into Hbase but not into my Solr collection. When I launch Hbase-indexer server and when I insert rows into my Hbase table, I get some errors :
Cannot connect to cluster at myIPaddress:2181/solr: cluster not found/not ready
Somebody to help me ? Thanks in advance.
Fred

Reply | Threaded
Open this post in threaded view
|

Re: Hbase indexer for SOLR

Sean Busbey-2
Hi Fred!

Unfortunately, that sounds like an issue on the Solr side.

The HBase community does not maintain any integration with solr, do
you know which hbase indexer project in particular you're using?

The most common one I've seen show up is the "Lily HBase Indexer" from
NGData[1]. If that's the one you're using, they have a user mailing
list[2] that can hopefully help sort things out.


[1]:  https://ngdata.github.io/hbase-indexer/
[2]: https://groups.google.com/forum/#!forum/hbase-indexer-user

On Wed, Jun 7, 2017 at 9:06 AM, F. T. <[hidden email]> wrote:
> Hi all,
> I want to extract fields from PDF files and store it into Hbase. Then I want to link the database with a Solr collection. To do this, I installed Hbase-indexer.
> What is the best way to do it ?
> Actually, I can write data into Hbase but not into my Solr collection. When I launch Hbase-indexer server and when I insert rows into my Hbase table, I get some errors :
> Cannot connect to cluster at myIPaddress:2181/solr: cluster not found/not ready
> Somebody to help me ? Thanks in advance.
> Fred
>
Reply | Threaded
Open this post in threaded view
|

RE: Hbase indexer for SOLR

F. T.
Thanks for for your answer. That's it, I use the Lily Hbase Indexer. As told before, I can create/delete an indexer. I can launch Hbase-indexer server command. It's stable until I insert a row into Hbase.

Here is msg I get :


17/06/07 16:10:23 INFO mortbay.log: Started SelectChannelConnector@0.0.0.0:11060
17/06/07 16:12:41 INFO hbase.Server: Connection from MyIpAddress port: 37592 with version info: version: "1.2.3" url: "git://kalashnikov.att.net/Users/stack/checkouts/hbase.git.commit" revision: "bd63744624a26dc3350137b564fe746df7a721a4" user: "stack" date: "Mon Aug 29 15:13:42 PDT 2016" src_checksum: "0ca49367ef6c3a680888bbc4f1485d18"
17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Initiating client connection, connectString=MyIpAddress:2181/solr sessionTimeout=30000 watcher=org.apache.solr.common.cloud.SolrZkClient$3@12dd2209
17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Opening socket connection to server MyServer/MyIpAddress:2181. Will not attempt to authenticate using SASL (unknown error)
17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Socket connection established to MyServer/MyIpAddress:2181, initiating session
17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Session establishment complete on server MyServer/MyIpAddress:2181, sessionid = 0x15c81c312060085, negotiated timeout = 30000
17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Session: 0x15c81c312060085 closed
17/06/07 16:12:42 INFO zookeeper.ClientCnxn: EventThread shut down
17/06/07 16:12:42 ERROR impl.SepEventExecutor: Error while processing event
java.lang.RuntimeException: org.apache.solr.common.SolrException: Cannot connect to cluster at MyIpAddress:2181/solr: cluster not found/not ready


Thanks again

Fred



________________________________
De : Sean Busbey <[hidden email]>
Envoyé : mercredi 7 juin 2017 16:30
À : [hidden email]
Objet : Re: Hbase indexer for SOLR

Hi Fred!

Unfortunately, that sounds like an issue on the Solr side.

The HBase community does not maintain any integration with solr, do
you know which hbase indexer project in particular you're using?

The most common one I've seen show up is the "Lily HBase Indexer" from
NGData[1]. If that's the one you're using, they have a user mailing
list[2] that can hopefully help sort things out.


[1]:  https://ngdata.github.io/hbase-indexer/

Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.io/hbase-indexer/>
ngdata.github.io
Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly and easily search for any content stored in HBase. It allows you to quickly and easily index ...



[2]: https://groups.google.com/forum/#!forum/hbase-indexer-user

On Wed, Jun 7, 2017 at 9:06 AM, F. T. <[hidden email]> wrote:
> Hi all,
> I want to extract fields from PDF files and store it into Hbase. Then I want to link the database with a Solr collection. To do this, I installed Hbase-indexer.
> What is the best way to do it ?
> Actually, I can write data into Hbase but not into my Solr collection. When I launch Hbase-indexer server and when I insert rows into my Hbase table, I get some errors :
> Cannot connect to cluster at myIPaddress:2181/solr: cluster not found/not ready
> Somebody to help me ? Thanks in advance.
> Fred
>
Reply | Threaded
Open this post in threaded view
|

Re: Hbase indexer for SOLR

Dima Spivak-2
The :2181/solr config looks suspect to me, but as Busbey points out,
questions of how to successfully set up Lily are probably better suited for
their user mailing list.

On Wed, Jun 7, 2017 at 7:41 AM F. T. <[hidden email]> wrote:

> Thanks for for your answer. That's it, I use the Lily Hbase Indexer. As
> told before, I can create/delete an indexer. I can launch Hbase-indexer
> server command. It's stable until I insert a row into Hbase.
>
> Here is msg I get :
>
>
> 17/06/07 16:10:23 INFO mortbay.log: Started
> SelectChannelConnector@0.0.0.0:11060
> 17/06/07 16:12:41 INFO hbase.Server: Connection from MyIpAddress port:
> 37592 with version info: version: "1.2.3" url: "git://
> kalashnikov.att.net/Users/stack/checkouts/hbase.git.commit" revision:
> "bd63744624a26dc3350137b564fe746df7a721a4" user: "stack" date: "Mon Aug 29
> 15:13:42 PDT 2016" src_checksum: "0ca49367ef6c3a680888bbc4f1485d18"
> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Initiating client connection,
> connectString=MyIpAddress:2181/solr sessionTimeout=30000
> watcher=org.apache.solr.common.cloud.SolrZkClient$3@12dd2209
> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Opening socket connection to
> server MyServer/MyIpAddress:2181. Will not attempt to authenticate using
> SASL (unknown error)
> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Socket connection established
> to MyServer/MyIpAddress:2181, initiating session
> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Session establishment
> complete on server MyServer/MyIpAddress:2181, sessionid =
> 0x15c81c312060085, negotiated timeout = 30000
> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Session: 0x15c81c312060085
> closed
> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: EventThread shut down
> 17/06/07 16:12:42 ERROR impl.SepEventExecutor: Error while processing event
> java.lang.RuntimeException: org.apache.solr.common.SolrException: Cannot
> connect to cluster at MyIpAddress:2181/solr: cluster not found/not ready
>
>
> Thanks again
>
> Fred
>
>
>
> ________________________________
> De : Sean Busbey <[hidden email]>
> Envoyé : mercredi 7 juin 2017 16:30
> À : [hidden email]
> Objet : Re: Hbase indexer for SOLR
>
> Hi Fred!
>
> Unfortunately, that sounds like an issue on the Solr side.
>
> The HBase community does not maintain any integration with solr, do
> you know which hbase indexer project in particular you're using?
>
> The most common one I've seen show up is the "Lily HBase Indexer" from
> NGData[1]. If that's the one you're using, they have a user mailing
> list[2] that can hopefully help sort things out.
>
>
> [1]:  https://ngdata.github.io/hbase-indexer/
>
> Lily HBase Indexer by NGDATA - GitHub Pages<
> https://ngdata.github.io/hbase-indexer/>
> ngdata.github.io
> Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly and
> easily search for any content stored in HBase. It allows you to quickly and
> easily index ...
>
>
>
> [2]: https://groups.google.com/forum/#!forum/hbase-indexer-user
>
> On Wed, Jun 7, 2017 at 9:06 AM, F. T. <[hidden email]> wrote:
> > Hi all,
> > I want to extract fields from PDF files and store it into Hbase. Then I
> want to link the database with a Solr collection. To do this, I installed
> Hbase-indexer.
> > What is the best way to do it ?
> > Actually, I can write data into Hbase but not into my Solr collection.
> When I launch Hbase-indexer server and when I insert rows into my Hbase
> table, I get some errors :
> > Cannot connect to cluster at myIPaddress:2181/solr: cluster not
> found/not ready
> > Somebody to help me ? Thanks in advance.
> > Fred
> >
>
--
-Dima
Reply | Threaded
Open this post in threaded view
|

Re: Hbase indexer for SOLR

Steen Manniche
It looks like it is trying to connect to a ZooKeeper instance. Have you set
up your environment according to the instructions on the lily website?

Br,

Den 7. jun. 2017 4.54 PM skrev "Dima Spivak" <[hidden email]>:

> The :2181/solr config looks suspect to me, but as Busbey points out,
> questions of how to successfully set up Lily are probably better suited for
> their user mailing list.
>
> On Wed, Jun 7, 2017 at 7:41 AM F. T. <[hidden email]> wrote:
>
> > Thanks for for your answer. That's it, I use the Lily Hbase Indexer. As
> > told before, I can create/delete an indexer. I can launch Hbase-indexer
> > server command. It's stable until I insert a row into Hbase.
> >
> > Here is msg I get :
> >
> >
> > 17/06/07 16:10:23 INFO mortbay.log: Started
> > SelectChannelConnector@0.0.0.0:11060
> > 17/06/07 16:12:41 INFO hbase.Server: Connection from MyIpAddress port:
> > 37592 with version info: version: "1.2.3" url: "git://
> > kalashnikov.att.net/Users/stack/checkouts/hbase.git.commit" revision:
> > "bd63744624a26dc3350137b564fe746df7a721a4" user: "stack" date: "Mon Aug
> 29
> > 15:13:42 PDT 2016" src_checksum: "0ca49367ef6c3a680888bbc4f1485d18"
> > 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Initiating client connection,
> > connectString=MyIpAddress:2181/solr sessionTimeout=30000
> > watcher=org.apache.solr.common.cloud.SolrZkClient$3@12dd2209
> > 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Opening socket connection to
> > server MyServer/MyIpAddress:2181. Will not attempt to authenticate using
> > SASL (unknown error)
> > 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Socket connection
> established
> > to MyServer/MyIpAddress:2181, initiating session
> > 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Session establishment
> > complete on server MyServer/MyIpAddress:2181, sessionid =
> > 0x15c81c312060085, negotiated timeout = 30000
> > 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Session: 0x15c81c312060085
> > closed
> > 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: EventThread shut down
> > 17/06/07 16:12:42 ERROR impl.SepEventExecutor: Error while processing
> event
> > java.lang.RuntimeException: org.apache.solr.common.SolrException: Cannot
> > connect to cluster at MyIpAddress:2181/solr: cluster not found/not ready
> >
> >
> > Thanks again
> >
> > Fred
> >
> >
> >
> > ________________________________
> > De : Sean Busbey <[hidden email]>
> > Envoyé : mercredi 7 juin 2017 16:30
> > À : [hidden email]
> > Objet : Re: Hbase indexer for SOLR
> >
> > Hi Fred!
> >
> > Unfortunately, that sounds like an issue on the Solr side.
> >
> > The HBase community does not maintain any integration with solr, do
> > you know which hbase indexer project in particular you're using?
> >
> > The most common one I've seen show up is the "Lily HBase Indexer" from
> > NGData[1]. If that's the one you're using, they have a user mailing
> > list[2] that can hopefully help sort things out.
> >
> >
> > [1]:  https://ngdata.github.io/hbase-indexer/
> >
> > Lily HBase Indexer by NGDATA - GitHub Pages<
> > https://ngdata.github.io/hbase-indexer/>
> > ngdata.github.io
> > Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> and
> > easily search for any content stored in HBase. It allows you to quickly
> and
> > easily index ...
> >
> >
> >
> > [2]: https://groups.google.com/forum/#!forum/hbase-indexer-user
> >
> > On Wed, Jun 7, 2017 at 9:06 AM, F. T. <[hidden email]> wrote:
> > > Hi all,
> > > I want to extract fields from PDF files and store it into Hbase. Then I
> > want to link the database with a Solr collection. To do this, I installed
> > Hbase-indexer.
> > > What is the best way to do it ?
> > > Actually, I can write data into Hbase but not into my Solr collection.
> > When I launch Hbase-indexer server and when I insert rows into my Hbase
> > table, I get some errors :
> > > Cannot connect to cluster at myIPaddress:2181/solr: cluster not
> > found/not ready
> > > Somebody to help me ? Thanks in advance.
> > > Fred
> > >
> >
> --
> -Dima
>
Reply | Threaded
Open this post in threaded view
|

RE: Hbase indexer for SOLR

F. T.
In reply to this post by Dima Spivak-2
Hi Dima,

I didn't find any help on their user mailing list about this problem. And I don't know where the :2181/solr comes from.

Here is my "hbase-indexer-site.xml" configuration file :


<property>

<name>hbase.zookeeper.quorum</name>

<value>MyIpAddress</value>

</property>

 

<property>

<name>hbaseindexer.zookeeper.connectstring</name>

<value>MyIpAddress:2181</value>

</property>


The only place where I mentionned /solr is at the indexer creation using :


./hbase-indexer add-indexer -n myindexer -c ../Fred_Indexer/indexdemo-indexer.xml -cp solr.zk=MyIpAddress:2181/solr -cp solr.collection=collection1


Any idea ?

Fred


________________________________
De : Dima Spivak <[hidden email]>
Envoyé : mercredi 7 juin 2017 16:54
À : [hidden email]
Objet : Re: Hbase indexer for SOLR

The :2181/solr config looks suspect to me, but as Busbey points out,
questions of how to successfully set up Lily are probably better suited for
their user mailing list.

On Wed, Jun 7, 2017 at 7:41 AM F. T. <[hidden email]> wrote:

> Thanks for for your answer. That's it, I use the Lily Hbase Indexer. As
> told before, I can create/delete an indexer. I can launch Hbase-indexer
> server command. It's stable until I insert a row into Hbase.
>
> Here is msg I get :
>
>
> 17/06/07 16:10:23 INFO mortbay.log: Started
> SelectChannelConnector@0.0.0.0:11060
> 17/06/07 16:12:41 INFO hbase.Server: Connection from MyIpAddress port:
> 37592 with version info: version: "1.2.3" url: "git://
> kalashnikov.att.net/Users/stack/checkouts/hbase.git.commit" revision:
> "bd63744624a26dc3350137b564fe746df7a721a4" user: "stack" date: "Mon Aug 29
> 15:13:42 PDT 2016" src_checksum: "0ca49367ef6c3a680888bbc4f1485d18"
> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Initiating client connection,
> connectString=MyIpAddress:2181/solr sessionTimeout=30000
> watcher=org.apache.solr.common.cloud.SolrZkClient$3@12dd2209
> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Opening socket connection to
> server MyServer/MyIpAddress:2181. Will not attempt to authenticate using
> SASL (unknown error)
> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Socket connection established
> to MyServer/MyIpAddress:2181, initiating session
> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Session establishment
> complete on server MyServer/MyIpAddress:2181, sessionid =
> 0x15c81c312060085, negotiated timeout = 30000
> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Session: 0x15c81c312060085
> closed
> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: EventThread shut down
> 17/06/07 16:12:42 ERROR impl.SepEventExecutor: Error while processing event
> java.lang.RuntimeException: org.apache.solr.common.SolrException: Cannot
> connect to cluster at MyIpAddress:2181/solr: cluster not found/not ready
>
>
> Thanks again
>
> Fred
>
>
>
> ________________________________
> De : Sean Busbey <[hidden email]>
> Envoyé : mercredi 7 juin 2017 16:30
> À : [hidden email]
> Objet : Re: Hbase indexer for SOLR
>
> Hi Fred!
>
> Unfortunately, that sounds like an issue on the Solr side.
>
> The HBase community does not maintain any integration with solr, do
> you know which hbase indexer project in particular you're using?
>
> The most common one I've seen show up is the "Lily HBase Indexer" from
> NGData[1]. If that's the one you're using, they have a user mailing
> list[2] that can hopefully help sort things out.
>
>
> [1]:  https://ngdata.github.io/hbase-indexer/

Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.io/hbase-indexer/>
ngdata.github.io
Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly and easily search for any content stored in HBase. It allows you to quickly and easily index ...



>
> Lily HBase Indexer by NGDATA - GitHub Pages<
> https://ngdata.github.io/hbase-indexer/>

Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.io/hbase-indexer/>
ngdata.github.io
Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly and easily search for any content stored in HBase. It allows you to quickly and easily index ...



> ngdata.github.io
> Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly and
> easily search for any content stored in HBase. It allows you to quickly and
> easily index ...
>
>
>
> [2]: https://groups.google.com/forum/#!forum/hbase-indexer-user

[http://www.google.com/images/icons/product/groups-128.png]<https://groups.google.com/forum/#!forum/hbase-indexer-user>

HBase Indexer Users<https://groups.google.com/forum/#!forum/hbase-indexer-user>
groups.google.com
Google Group



>
> On Wed, Jun 7, 2017 at 9:06 AM, F. T. <[hidden email]> wrote:
> > Hi all,
> > I want to extract fields from PDF files and store it into Hbase. Then I
> want to link the database with a Solr collection. To do this, I installed
> Hbase-indexer.
> > What is the best way to do it ?
> > Actually, I can write data into Hbase but not into my Solr collection.
> When I launch Hbase-indexer server and when I insert rows into my Hbase
> table, I get some errors :
> > Cannot connect to cluster at myIPaddress:2181/solr: cluster not
> found/not ready
> > Somebody to help me ? Thanks in advance.
> > Fred
> >
>
--
-Dima
Reply | Threaded
Open this post in threaded view
|

RE: Hbase indexer for SOLR

F. T.
In reply to this post by Steen Manniche
I followed this instructions : https://github.com/NGDATA/hbase-indexer/wiki/Installation

Could you tell me what is supect for you in :2181/solr ? And help me to find what is wrong ?

Can you explain :  it is trying to connect to a ZooKeeper instance ? I think the Zookeeper intance running is an embeded  instance provided by HBASE (not sure). Is it the best way to do or is it better to launch Zookeeper "alone" (zkServer.sh start) ?

Thanks in advance

Fred

________________________________
De : Steen Manniche <[hidden email]>
Envoyé : mercredi 7 juin 2017 19:13
À : [hidden email]
Objet : Re: Hbase indexer for SOLR

It looks like it is trying to connect to a ZooKeeper instance. Have you set
up your environment according to the instructions on the lily website?

Br,

Den 7. jun. 2017 4.54 PM skrev "Dima Spivak" <[hidden email]>:

> The :2181/solr config looks suspect to me, but as Busbey points out,
> questions of how to successfully set up Lily are probably better suited for
> their user mailing list.
>
> On Wed, Jun 7, 2017 at 7:41 AM F. T. <[hidden email]> wrote:
>
> > Thanks for for your answer. That's it, I use the Lily Hbase Indexer. As
> > told before, I can create/delete an indexer. I can launch Hbase-indexer
> > server command. It's stable until I insert a row into Hbase.
> >
> > Here is msg I get :
> >
> >
> > 17/06/07 16:10:23 INFO mortbay.log: Started
> > SelectChannelConnector@0.0.0.0:11060
> > 17/06/07 16:12:41 INFO hbase.Server: Connection from MyIpAddress port:
> > 37592 with version info: version: "1.2.3" url: "git://
> > kalashnikov.att.net/Users/stack/checkouts/hbase.git.commit" revision:
> > "bd63744624a26dc3350137b564fe746df7a721a4" user: "stack" date: "Mon Aug
> 29
> > 15:13:42 PDT 2016" src_checksum: "0ca49367ef6c3a680888bbc4f1485d18"
> > 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Initiating client connection,
> > connectString=MyIpAddress:2181/solr sessionTimeout=30000
> > watcher=org.apache.solr.common.cloud.SolrZkClient$3@12dd2209
> > 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Opening socket connection to
> > server MyServer/MyIpAddress:2181. Will not attempt to authenticate using
> > SASL (unknown error)
> > 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Socket connection
> established
> > to MyServer/MyIpAddress:2181, initiating session
> > 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Session establishment
> > complete on server MyServer/MyIpAddress:2181, sessionid =
> > 0x15c81c312060085, negotiated timeout = 30000
> > 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Session: 0x15c81c312060085
> > closed
> > 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: EventThread shut down
> > 17/06/07 16:12:42 ERROR impl.SepEventExecutor: Error while processing
> event
> > java.lang.RuntimeException: org.apache.solr.common.SolrException: Cannot
> > connect to cluster at MyIpAddress:2181/solr: cluster not found/not ready
> >
> >
> > Thanks again
> >
> > Fred
> >
> >
> >
> > ________________________________
> > De : Sean Busbey <[hidden email]>
> > Envoyé : mercredi 7 juin 2017 16:30
> > À : [hidden email]
> > Objet : Re: Hbase indexer for SOLR
> >
> > Hi Fred!
> >
> > Unfortunately, that sounds like an issue on the Solr side.
> >
> > The HBase community does not maintain any integration with solr, do
> > you know which hbase indexer project in particular you're using?
> >
> > The most common one I've seen show up is the "Lily HBase Indexer" from
> > NGData[1]. If that's the one you're using, they have a user mailing
> > list[2] that can hopefully help sort things out.
> >
> >
> > [1]:  https://ngdata.github.io/hbase-indexer/

Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.io/hbase-indexer/>
ngdata.github.io
Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly and easily search for any content stored in HBase. It allows you to quickly and easily index ...



> >
> > Lily HBase Indexer by NGDATA - GitHub Pages<
> > https://ngdata.github.io/hbase-indexer/>

Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.io/hbase-indexer/>
ngdata.github.io
Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly and easily search for any content stored in HBase. It allows you to quickly and easily index ...



> > ngdata.github.io
> > Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> and
> > easily search for any content stored in HBase. It allows you to quickly
> and
> > easily index ...
> >
> >
> >
> > [2]: https://groups.google.com/forum/#!forum/hbase-indexer-user

[http://www.google.com/images/icons/product/groups-128.png]<https://groups.google.com/forum/#!forum/hbase-indexer-user>

HBase Indexer Users<https://groups.google.com/forum/#!forum/hbase-indexer-user>
groups.google.com
Google Group



> >
> > On Wed, Jun 7, 2017 at 9:06 AM, F. T. <[hidden email]> wrote:
> > > Hi all,
> > > I want to extract fields from PDF files and store it into Hbase. Then I
> > want to link the database with a Solr collection. To do this, I installed
> > Hbase-indexer.
> > > What is the best way to do it ?
> > > Actually, I can write data into Hbase but not into my Solr collection.
> > When I launch Hbase-indexer server and when I insert rows into my Hbase
> > table, I get some errors :
> > > Cannot connect to cluster at myIPaddress:2181/solr: cluster not
> > found/not ready
> > > Somebody to help me ? Thanks in advance.
> > > Fred
> > >
> >
> --
> -Dima
>
Reply | Threaded
Open this post in threaded view
|

Re: Hbase indexer for SOLR

Sean Busbey-2
In reply to this post by F. T.
Hi Fred!

Are you sure your message went through? The last message I see on the
lily hbase indexer user list is from May 21st.

As we mentioned before, the HBase project isn't familiar with the lily
hbase indexer; it's a third party project unrelated to our community.
I'm afraid we're unlikely to be of much help.

Since that particular step is titled "Start Solr" it's possible the
solr user mailing list might be able to help[1]. Just keep in mind
it's very likely they will also point you back towards the NGData
maintained user list.

[1]: https://lists.apache.org/list.html?solr-user@...


On Thu, Jun 8, 2017 at 1:38 AM, F. T. <[hidden email]> wrote:

> Hi Dima,
>
> I didn't find any help on their user mailing list about this problem. And I don't know where the :2181/solr comes from.
>
> Here is my "hbase-indexer-site.xml" configuration file :
>
>
> <property>
>
> <name>hbase.zookeeper.quorum</name>
>
> <value>MyIpAddress</value>
>
> </property>
>
>
>
> <property>
>
> <name>hbaseindexer.zookeeper.connectstring</name>
>
> <value>MyIpAddress:2181</value>
>
> </property>
>
>
> The only place where I mentionned /solr is at the indexer creation using :
>
>
> ./hbase-indexer add-indexer -n myindexer -c ../Fred_Indexer/indexdemo-indexer.xml -cp solr.zk=MyIpAddress:2181/solr -cp solr.collection=collection1
>
>
> Any idea ?
>
> Fred
>
>
> ________________________________
> De : Dima Spivak <[hidden email]>
> Envoyé : mercredi 7 juin 2017 16:54
> À : [hidden email]
> Objet : Re: Hbase indexer for SOLR
>
> The :2181/solr config looks suspect to me, but as Busbey points out,
> questions of how to successfully set up Lily are probably better suited for
> their user mailing list.
>
> On Wed, Jun 7, 2017 at 7:41 AM F. T. <[hidden email]> wrote:
>
>> Thanks for for your answer. That's it, I use the Lily Hbase Indexer. As
>> told before, I can create/delete an indexer. I can launch Hbase-indexer
>> server command. It's stable until I insert a row into Hbase.
>>
>> Here is msg I get :
>>
>>
>> 17/06/07 16:10:23 INFO mortbay.log: Started
>> SelectChannelConnector@0.0.0.0:11060
>> 17/06/07 16:12:41 INFO hbase.Server: Connection from MyIpAddress port:
>> 37592 with version info: version: "1.2.3" url: "git://
>> kalashnikov.att.net/Users/stack/checkouts/hbase.git.commit" revision:
>> "bd63744624a26dc3350137b564fe746df7a721a4" user: "stack" date: "Mon Aug 29
>> 15:13:42 PDT 2016" src_checksum: "0ca49367ef6c3a680888bbc4f1485d18"
>> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Initiating client connection,
>> connectString=MyIpAddress:2181/solr sessionTimeout=30000
>> watcher=org.apache.solr.common.cloud.SolrZkClient$3@12dd2209
>> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Opening socket connection to
>> server MyServer/MyIpAddress:2181. Will not attempt to authenticate using
>> SASL (unknown error)
>> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Socket connection established
>> to MyServer/MyIpAddress:2181, initiating session
>> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Session establishment
>> complete on server MyServer/MyIpAddress:2181, sessionid =
>> 0x15c81c312060085, negotiated timeout = 30000
>> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Session: 0x15c81c312060085
>> closed
>> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: EventThread shut down
>> 17/06/07 16:12:42 ERROR impl.SepEventExecutor: Error while processing event
>> java.lang.RuntimeException: org.apache.solr.common.SolrException: Cannot
>> connect to cluster at MyIpAddress:2181/solr: cluster not found/not ready
>>
>>
>> Thanks again
>>
>> Fred
>>
>>
>>
>> ________________________________
>> De : Sean Busbey <[hidden email]>
>> Envoyé : mercredi 7 juin 2017 16:30
>> À : [hidden email]
>> Objet : Re: Hbase indexer for SOLR
>>
>> Hi Fred!
>>
>> Unfortunately, that sounds like an issue on the Solr side.
>>
>> The HBase community does not maintain any integration with solr, do
>> you know which hbase indexer project in particular you're using?
>>
>> The most common one I've seen show up is the "Lily HBase Indexer" from
>> NGData[1]. If that's the one you're using, they have a user mailing
>> list[2] that can hopefully help sort things out.
>>
>>
>> [1]:  https://ngdata.github.io/hbase-indexer/
>
> Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.io/hbase-indexer/>
> ngdata.github.io
> Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly and easily search for any content stored in HBase. It allows you to quickly and easily index ...
>
>
>
>>
>> Lily HBase Indexer by NGDATA - GitHub Pages<
>> https://ngdata.github.io/hbase-indexer/>
>
> Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.io/hbase-indexer/>
> ngdata.github.io
> Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly and easily search for any content stored in HBase. It allows you to quickly and easily index ...
>
>
>
>> ngdata.github.io
>> Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly and
>> easily search for any content stored in HBase. It allows you to quickly and
>> easily index ...
>>
>>
>>
>> [2]: https://groups.google.com/forum/#!forum/hbase-indexer-user
>
> [http://www.google.com/images/icons/product/groups-128.png]<https://groups.google.com/forum/#!forum/hbase-indexer-user>
>
> HBase Indexer Users<https://groups.google.com/forum/#!forum/hbase-indexer-user>
> groups.google.com
> Google Group
>
>
>
>>
>> On Wed, Jun 7, 2017 at 9:06 AM, F. T. <[hidden email]> wrote:
>> > Hi all,
>> > I want to extract fields from PDF files and store it into Hbase. Then I
>> want to link the database with a Solr collection. To do this, I installed
>> Hbase-indexer.
>> > What is the best way to do it ?
>> > Actually, I can write data into Hbase but not into my Solr collection.
>> When I launch Hbase-indexer server and when I insert rows into my Hbase
>> table, I get some errors :
>> > Cannot connect to cluster at myIPaddress:2181/solr: cluster not
>> found/not ready
>> > Somebody to help me ? Thanks in advance.
>> > Fred
>> >
>>
> --
> -Dima
Reply | Threaded
Open this post in threaded view
|

Re: Hbase indexer for SOLR

Dima Spivak-3
Hey Fred,

Sorry we can't be more helpful, it's just that you're asking about a piece
of software that we don't use :(. A quick search turned up Cloudera's docs
on using Lily [1], so maybe give those a shot and then direct questions to
the Lily list Busbey pointed to if things are still misbehaving?

[1]
https://www.cloudera.com/documentation/enterprise/latest/topics/search_use_hbase_indexer_service.html

-Dima

On Thu, Jun 8, 2017 at 8:04 AM, Sean Busbey <[hidden email]> wrote:

> Hi Fred!
>
> Are you sure your message went through? The last message I see on the
> lily hbase indexer user list is from May 21st.
>
> As we mentioned before, the HBase project isn't familiar with the lily
> hbase indexer; it's a third party project unrelated to our community.
> I'm afraid we're unlikely to be of much help.
>
> Since that particular step is titled "Start Solr" it's possible the
> solr user mailing list might be able to help[1]. Just keep in mind
> it's very likely they will also point you back towards the NGData
> maintained user list.
>
> [1]: https://lists.apache.org/list.html?solr-user@...
>
>
> On Thu, Jun 8, 2017 at 1:38 AM, F. T. <[hidden email]> wrote:
> > Hi Dima,
> >
> > I didn't find any help on their user mailing list about this problem.
> And I don't know where the :2181/solr comes from.
> >
> > Here is my "hbase-indexer-site.xml" configuration file :
> >
> >
> > <property>
> >
> > <name>hbase.zookeeper.quorum</name>
> >
> > <value>MyIpAddress</value>
> >
> > </property>
> >
> >
> >
> > <property>
> >
> > <name>hbaseindexer.zookeeper.connectstring</name>
> >
> > <value>MyIpAddress:2181</value>
> >
> > </property>
> >
> >
> > The only place where I mentionned /solr is at the indexer creation using
> :
> >
> >
> > ./hbase-indexer add-indexer -n myindexer -c ../Fred_Indexer/indexdemo-indexer.xml
> -cp solr.zk=MyIpAddress:2181/solr -cp solr.collection=collection1
> >
> >
> > Any idea ?
> >
> > Fred
> >
> >
> > ________________________________
> > De : Dima Spivak <[hidden email]>
> > Envoyé : mercredi 7 juin 2017 16:54
> > À : [hidden email]
> > Objet : Re: Hbase indexer for SOLR
> >
> > The :2181/solr config looks suspect to me, but as Busbey points out,
> > questions of how to successfully set up Lily are probably better suited
> for
> > their user mailing list.
> >
> > On Wed, Jun 7, 2017 at 7:41 AM F. T. <[hidden email]> wrote:
> >
> >> Thanks for for your answer. That's it, I use the Lily Hbase Indexer. As
> >> told before, I can create/delete an indexer. I can launch Hbase-indexer
> >> server command. It's stable until I insert a row into Hbase.
> >>
> >> Here is msg I get :
> >>
> >>
> >> 17/06/07 16:10:23 INFO mortbay.log: Started
> >> SelectChannelConnector@0.0.0.0:11060
> >> 17/06/07 16:12:41 INFO hbase.Server: Connection from MyIpAddress port:
> >> 37592 with version info: version: "1.2.3" url: "git://
> >> kalashnikov.att.net/Users/stack/checkouts/hbase.git.commit" revision:
> >> "bd63744624a26dc3350137b564fe746df7a721a4" user: "stack" date: "Mon
> Aug 29
> >> 15:13:42 PDT 2016" src_checksum: "0ca49367ef6c3a680888bbc4f1485d18"
> >> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Initiating client
> connection,
> >> connectString=MyIpAddress:2181/solr sessionTimeout=30000
> >> watcher=org.apache.solr.common.cloud.SolrZkClient$3@12dd2209
> >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Opening socket connection
> to
> >> server MyServer/MyIpAddress:2181. Will not attempt to authenticate using
> >> SASL (unknown error)
> >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Socket connection
> established
> >> to MyServer/MyIpAddress:2181, initiating session
> >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Session establishment
> >> complete on server MyServer/MyIpAddress:2181, sessionid =
> >> 0x15c81c312060085, negotiated timeout = 30000
> >> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Session: 0x15c81c312060085
> >> closed
> >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: EventThread shut down
> >> 17/06/07 16:12:42 ERROR impl.SepEventExecutor: Error while processing
> event
> >> java.lang.RuntimeException: org.apache.solr.common.SolrException:
> Cannot
> >> connect to cluster at MyIpAddress:2181/solr: cluster not found/not ready
> >>
> >>
> >> Thanks again
> >>
> >> Fred
> >>
> >>
> >>
> >> ________________________________
> >> De : Sean Busbey <[hidden email]>
> >> Envoyé : mercredi 7 juin 2017 16:30
> >> À : [hidden email]
> >> Objet : Re: Hbase indexer for SOLR
> >>
> >> Hi Fred!
> >>
> >> Unfortunately, that sounds like an issue on the Solr side.
> >>
> >> The HBase community does not maintain any integration with solr, do
> >> you know which hbase indexer project in particular you're using?
> >>
> >> The most common one I've seen show up is the "Lily HBase Indexer" from
> >> NGData[1]. If that's the one you're using, they have a user mailing
> >> list[2] that can hopefully help sort things out.
> >>
> >>
> >> [1]:  https://ngdata.github.io/hbase-indexer/
> >
> > Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.
> io/hbase-indexer/>
> > ngdata.github.io
> > Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> and easily search for any content stored in HBase. It allows you to quickly
> and easily index ...
> >
> >
> >
> >>
> >> Lily HBase Indexer by NGDATA - GitHub Pages<
> >> https://ngdata.github.io/hbase-indexer/>
> >
> > Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.
> io/hbase-indexer/>
> > ngdata.github.io
> > Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> and easily search for any content stored in HBase. It allows you to quickly
> and easily index ...
> >
> >
> >
> >> ngdata.github.io
> >> Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> and
> >> easily search for any content stored in HBase. It allows you to quickly
> and
> >> easily index ...
> >>
> >>
> >>
> >> [2]: https://groups.google.com/forum/#!forum/hbase-indexer-user
> >
> > [http://www.google.com/images/icons/product/groups-128.png]<
> https://groups.google.com/forum/#!forum/hbase-indexer-user>
> >
> > HBase Indexer Users<https://groups.google.com/forum/#!forum/hbase-
> indexer-user>
> > groups.google.com
> > Google Group
> >
> >
> >
> >>
> >> On Wed, Jun 7, 2017 at 9:06 AM, F. T. <[hidden email]> wrote:
> >> > Hi all,
> >> > I want to extract fields from PDF files and store it into Hbase. Then
> I
> >> want to link the database with a Solr collection. To do this, I
> installed
> >> Hbase-indexer.
> >> > What is the best way to do it ?
> >> > Actually, I can write data into Hbase but not into my Solr collection.
> >> When I launch Hbase-indexer server and when I insert rows into my Hbase
> >> table, I get some errors :
> >> > Cannot connect to cluster at myIPaddress:2181/solr: cluster not
> >> found/not ready
> >> > Somebody to help me ? Thanks in advance.
> >> > Fred
> >> >
> >>
> > --
> > -Dima
>
Reply | Threaded
Open this post in threaded view
|

Re: Hbase indexer for SOLR

Jean-Marc Spaggiari
Hi Fred,

Just run your own ZK instance and not the embedded one. It's pretty small
and easy to start. Make sure SOLR is configured to use /SOLR in ZK and not /

You can try to use "hbase zkcli" to see if you ZK server is running wall
(and that HBase can talk to it well)

JMS

2017-06-08 12:37 GMT-04:00 Dima Spivak <[hidden email]>:

> Hey Fred,
>
> Sorry we can't be more helpful, it's just that you're asking about a piece
> of software that we don't use :(. A quick search turned up Cloudera's docs
> on using Lily [1], so maybe give those a shot and then direct questions to
> the Lily list Busbey pointed to if things are still misbehaving?
>
> [1]
> https://www.cloudera.com/documentation/enterprise/
> latest/topics/search_use_hbase_indexer_service.html
>
> -Dima
>
> On Thu, Jun 8, 2017 at 8:04 AM, Sean Busbey <[hidden email]> wrote:
>
> > Hi Fred!
> >
> > Are you sure your message went through? The last message I see on the
> > lily hbase indexer user list is from May 21st.
> >
> > As we mentioned before, the HBase project isn't familiar with the lily
> > hbase indexer; it's a third party project unrelated to our community.
> > I'm afraid we're unlikely to be of much help.
> >
> > Since that particular step is titled "Start Solr" it's possible the
> > solr user mailing list might be able to help[1]. Just keep in mind
> > it's very likely they will also point you back towards the NGData
> > maintained user list.
> >
> > [1]: https://lists.apache.org/list.html?solr-user@...
> >
> >
> > On Thu, Jun 8, 2017 at 1:38 AM, F. T. <[hidden email]> wrote:
> > > Hi Dima,
> > >
> > > I didn't find any help on their user mailing list about this problem.
> > And I don't know where the :2181/solr comes from.
> > >
> > > Here is my "hbase-indexer-site.xml" configuration file :
> > >
> > >
> > > <property>
> > >
> > > <name>hbase.zookeeper.quorum</name>
> > >
> > > <value>MyIpAddress</value>
> > >
> > > </property>
> > >
> > >
> > >
> > > <property>
> > >
> > > <name>hbaseindexer.zookeeper.connectstring</name>
> > >
> > > <value>MyIpAddress:2181</value>
> > >
> > > </property>
> > >
> > >
> > > The only place where I mentionned /solr is at the indexer creation
> using
> > :
> > >
> > >
> > > ./hbase-indexer add-indexer -n myindexer -c ../Fred_Indexer/indexdemo-
> indexer.xml
> > -cp solr.zk=MyIpAddress:2181/solr -cp solr.collection=collection1
> > >
> > >
> > > Any idea ?
> > >
> > > Fred
> > >
> > >
> > > ________________________________
> > > De : Dima Spivak <[hidden email]>
> > > Envoyé : mercredi 7 juin 2017 16:54
> > > À : [hidden email]
> > > Objet : Re: Hbase indexer for SOLR
> > >
> > > The :2181/solr config looks suspect to me, but as Busbey points out,
> > > questions of how to successfully set up Lily are probably better suited
> > for
> > > their user mailing list.
> > >
> > > On Wed, Jun 7, 2017 at 7:41 AM F. T. <[hidden email]> wrote:
> > >
> > >> Thanks for for your answer. That's it, I use the Lily Hbase Indexer.
> As
> > >> told before, I can create/delete an indexer. I can launch
> Hbase-indexer
> > >> server command. It's stable until I insert a row into Hbase.
> > >>
> > >> Here is msg I get :
> > >>
> > >>
> > >> 17/06/07 16:10:23 INFO mortbay.log: Started
> > >> SelectChannelConnector@0.0.0.0:11060
> > >> 17/06/07 16:12:41 INFO hbase.Server: Connection from MyIpAddress port:
> > >> 37592 with version info: version: "1.2.3" url: "git://
> > >> kalashnikov.att.net/Users/stack/checkouts/hbase.git.commit" revision:
> > >> "bd63744624a26dc3350137b564fe746df7a721a4" user: "stack" date: "Mon
> > Aug 29
> > >> 15:13:42 PDT 2016" src_checksum: "0ca49367ef6c3a680888bbc4f1485d18"
> > >> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Initiating client
> > connection,
> > >> connectString=MyIpAddress:2181/solr sessionTimeout=30000
> > >> watcher=org.apache.solr.common.cloud.SolrZkClient$3@12dd2209
> > >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Opening socket connection
> > to
> > >> server MyServer/MyIpAddress:2181. Will not attempt to authenticate
> using
> > >> SASL (unknown error)
> > >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Socket connection
> > established
> > >> to MyServer/MyIpAddress:2181, initiating session
> > >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Session establishment
> > >> complete on server MyServer/MyIpAddress:2181, sessionid =
> > >> 0x15c81c312060085, negotiated timeout = 30000
> > >> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Session: 0x15c81c312060085
> > >> closed
> > >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: EventThread shut down
> > >> 17/06/07 16:12:42 ERROR impl.SepEventExecutor: Error while processing
> > event
> > >> java.lang.RuntimeException: org.apache.solr.common.SolrException:
> > Cannot
> > >> connect to cluster at MyIpAddress:2181/solr: cluster not found/not
> ready
> > >>
> > >>
> > >> Thanks again
> > >>
> > >> Fred
> > >>
> > >>
> > >>
> > >> ________________________________
> > >> De : Sean Busbey <[hidden email]>
> > >> Envoyé : mercredi 7 juin 2017 16:30
> > >> À : [hidden email]
> > >> Objet : Re: Hbase indexer for SOLR
> > >>
> > >> Hi Fred!
> > >>
> > >> Unfortunately, that sounds like an issue on the Solr side.
> > >>
> > >> The HBase community does not maintain any integration with solr, do
> > >> you know which hbase indexer project in particular you're using?
> > >>
> > >> The most common one I've seen show up is the "Lily HBase Indexer" from
> > >> NGData[1]. If that's the one you're using, they have a user mailing
> > >> list[2] that can hopefully help sort things out.
> > >>
> > >>
> > >> [1]:  https://ngdata.github.io/hbase-indexer/
> > >
> > > Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.
> > io/hbase-indexer/>
> > > ngdata.github.io
> > > Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> > and easily search for any content stored in HBase. It allows you to
> quickly
> > and easily index ...
> > >
> > >
> > >
> > >>
> > >> Lily HBase Indexer by NGDATA - GitHub Pages<
> > >> https://ngdata.github.io/hbase-indexer/>
> > >
> > > Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.
> > io/hbase-indexer/>
> > > ngdata.github.io
> > > Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> > and easily search for any content stored in HBase. It allows you to
> quickly
> > and easily index ...
> > >
> > >
> > >
> > >> ngdata.github.io
> > >> Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> > and
> > >> easily search for any content stored in HBase. It allows you to
> quickly
> > and
> > >> easily index ...
> > >>
> > >>
> > >>
> > >> [2]: https://groups.google.com/forum/#!forum/hbase-indexer-user
> > >
> > > [http://www.google.com/images/icons/product/groups-128.png]<
> > https://groups.google.com/forum/#!forum/hbase-indexer-user>
> > >
> > > HBase Indexer Users<https://groups.google.com/forum/#!forum/hbase-
> > indexer-user>
> > > groups.google.com
> > > Google Group
> > >
> > >
> > >
> > >>
> > >> On Wed, Jun 7, 2017 at 9:06 AM, F. T. <[hidden email]> wrote:
> > >> > Hi all,
> > >> > I want to extract fields from PDF files and store it into Hbase.
> Then
> > I
> > >> want to link the database with a Solr collection. To do this, I
> > installed
> > >> Hbase-indexer.
> > >> > What is the best way to do it ?
> > >> > Actually, I can write data into Hbase but not into my Solr
> collection.
> > >> When I launch Hbase-indexer server and when I insert rows into my
> Hbase
> > >> table, I get some errors :
> > >> > Cannot connect to cluster at myIPaddress:2181/solr: cluster not
> > >> found/not ready
> > >> > Somebody to help me ? Thanks in advance.
> > >> > Fred
> > >> >
> > >>
> > > --
> > > -Dima
> >
>
Reply | Threaded
Open this post in threaded view
|

RE : Hbase indexer for SOLR

F. T.
Hi Jean Marc
Very kind of you tout answer even if I am on the wrong forum. It seems good now, no more error messages when I write into my hbase table. But it does not write into my collection.
I just created my collection with the command create -c mycollection. I'll try another way to create it tomorrow. Excepted if you 've got an idéal ... maybe the solution is to run Zookeeper not in embedded mode like you suggested.
Thanks again
Fred


Envoyé de mon Galaxy S6 Orange


-------- Message d'origine --------
De : Jean-Marc Spaggiari <[hidden email]>
Date : 08/06/2017 18:49 (GMT+01:00)
À : user <[hidden email]>
Objet : Re: Hbase indexer for SOLR

Hi Fred,

Just run your own ZK instance and not the embedded one. It's pretty small
and easy to start. Make sure SOLR is configured to use /SOLR in ZK and not /

You can try to use "hbase zkcli" to see if you ZK server is running wall
(and that HBase can talk to it well)

JMS

2017-06-08 12:37 GMT-04:00 Dima Spivak <[hidden email]>:

> Hey Fred,
>
> Sorry we can't be more helpful, it's just that you're asking about a piece
> of software that we don't use :(. A quick search turned up Cloudera's docs
> on using Lily [1], so maybe give those a shot and then direct questions to
> the Lily list Busbey pointed to if things are still misbehaving?
>
> [1]
> https://www.cloudera.com/documentation/enterprise/
> latest/topics/search_use_hbase_indexer_service.html
>
> -Dima
>
> On Thu, Jun 8, 2017 at 8:04 AM, Sean Busbey <[hidden email]> wrote:
>
> > Hi Fred!
> >
> > Are you sure your message went through? The last message I see on the
> > lily hbase indexer user list is from May 21st.
> >
> > As we mentioned before, the HBase project isn't familiar with the lily
> > hbase indexer; it's a third party project unrelated to our community.
> > I'm afraid we're unlikely to be of much help.
> >
> > Since that particular step is titled "Start Solr" it's possible the
> > solr user mailing list might be able to help[1]. Just keep in mind
> > it's very likely they will also point you back towards the NGData
> > maintained user list.
> >
> > [1]: https://lists.apache.org/list.html?solr-user@...
> >
> >
> > On Thu, Jun 8, 2017 at 1:38 AM, F. T. <[hidden email]> wrote:
> > > Hi Dima,
> > >
> > > I didn't find any help on their user mailing list about this problem.
> > And I don't know where the :2181/solr comes from.
> > >
> > > Here is my "hbase-indexer-site.xml" configuration file :
> > >
> > >
> > > <property>
> > >
> > > <name>hbase.zookeeper.quorum</name>
> > >
> > > <value>MyIpAddress</value>
> > >
> > > </property>
> > >
> > >
> > >
> > > <property>
> > >
> > > <name>hbaseindexer.zookeeper.connectstring</name>
> > >
> > > <value>MyIpAddress:2181</value>
> > >
> > > </property>
> > >
> > >
> > > The only place where I mentionned /solr is at the indexer creation
> using
> > :
> > >
> > >
> > > ./hbase-indexer add-indexer -n myindexer -c ../Fred_Indexer/indexdemo-
> indexer.xml
> > -cp solr.zk=MyIpAddress:2181/solr -cp solr.collection=collection1
> > >
> > >
> > > Any idea ?
> > >
> > > Fred
> > >
> > >
> > > ________________________________
> > > De : Dima Spivak <[hidden email]>
> > > Envoyé : mercredi 7 juin 2017 16:54
> > > À : [hidden email]
> > > Objet : Re: Hbase indexer for SOLR
> > >
> > > The :2181/solr config looks suspect to me, but as Busbey points out,
> > > questions of how to successfully set up Lily are probably better suited
> > for
> > > their user mailing list.
> > >
> > > On Wed, Jun 7, 2017 at 7:41 AM F. T. <[hidden email]> wrote:
> > >
> > >> Thanks for for your answer. That's it, I use the Lily Hbase Indexer.
> As
> > >> told before, I can create/delete an indexer. I can launch
> Hbase-indexer
> > >> server command. It's stable until I insert a row into Hbase.
> > >>
> > >> Here is msg I get :
> > >>
> > >>
> > >> 17/06/07 16:10:23 INFO mortbay.log: Started
> > >> SelectChannelConnector@0.0.0.0:11060
> > >> 17/06/07 16:12:41 INFO hbase.Server: Connection from MyIpAddress port:
> > >> 37592 with version info: version: "1.2.3" url: "git://
> > >> kalashnikov.att.net/Users/stack/checkouts/hbase.git.commit" revision:
> > >> "bd63744624a26dc3350137b564fe746df7a721a4" user: "stack" date: "Mon
> > Aug 29
> > >> 15:13:42 PDT 2016" src_checksum: "0ca49367ef6c3a680888bbc4f1485d18"
> > >> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Initiating client
> > connection,
> > >> connectString=MyIpAddress:2181/solr sessionTimeout=30000
> > >> watcher=org.apache.solr.common.cloud.SolrZkClient$3@12dd2209
> > >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Opening socket connection
> > to
> > >> server MyServer/MyIpAddress:2181. Will not attempt to authenticate
> using
> > >> SASL (unknown error)
> > >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Socket connection
> > established
> > >> to MyServer/MyIpAddress:2181, initiating session
> > >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: Session establishment
> > >> complete on server MyServer/MyIpAddress:2181, sessionid =
> > >> 0x15c81c312060085, negotiated timeout = 30000
> > >> 17/06/07 16:12:42 INFO zookeeper.ZooKeeper: Session: 0x15c81c312060085
> > >> closed
> > >> 17/06/07 16:12:42 INFO zookeeper.ClientCnxn: EventThread shut down
> > >> 17/06/07 16:12:42 ERROR impl.SepEventExecutor: Error while processing
> > event
> > >> java.lang.RuntimeException: org.apache.solr.common.SolrException:
> > Cannot
> > >> connect to cluster at MyIpAddress:2181/solr: cluster not found/not
> ready
> > >>
> > >>
> > >> Thanks again
> > >>
> > >> Fred
> > >>
> > >>
> > >>
> > >> ________________________________
> > >> De : Sean Busbey <[hidden email]>
> > >> Envoyé : mercredi 7 juin 2017 16:30
> > >> À : [hidden email]
> > >> Objet : Re: Hbase indexer for SOLR
> > >>
> > >> Hi Fred!
> > >>
> > >> Unfortunately, that sounds like an issue on the Solr side.
> > >>
> > >> The HBase community does not maintain any integration with solr, do
> > >> you know which hbase indexer project in particular you're using?
> > >>
> > >> The most common one I've seen show up is the "Lily HBase Indexer" from
> > >> NGData[1]. If that's the one you're using, they have a user mailing
> > >> list[2] that can hopefully help sort things out.
> > >>
> > >>
> > >> [1]:  https://ngdata.github.io/hbase-indexer/
> > >
> > > Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.
> > io/hbase-indexer/>
> > > ngdata.github.io
> > > Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> > and easily search for any content stored in HBase. It allows you to
> quickly
> > and easily index ...
> > >
> > >
> > >
> > >>
> > >> Lily HBase Indexer by NGDATA - GitHub Pages<
> > >> https://ngdata.github.io/hbase-indexer/>
> > >
> > > Lily HBase Indexer by NGDATA - GitHub Pages<https://ngdata.github.
> > io/hbase-indexer/>
> > > ngdata.github.io
> > > Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> > and easily search for any content stored in HBase. It allows you to
> quickly
> > and easily index ...
> > >
> > >
> > >
> > >> ngdata.github.io
> > >> Lily HBase Indexer. Lily HBase Indexer provides the ability to quickly
> > and
> > >> easily search for any content stored in HBase. It allows you to
> quickly
> > and
> > >> easily index ...
> > >>
> > >>
> > >>
> > >> [2]: https://groups.google.com/forum/#!forum/hbase-indexer-user
> > >
> > > [http://www.google.com/images/icons/product/groups-128.png]<
> > https://groups.google.com/forum/#!forum/hbase-indexer-user>
> > >
> > > HBase Indexer Users<https://groups.google.com/forum/#!forum/hbase-
> > indexer-user>
> > > groups.google.com
> > > Google Group
> > >
> > >
> > >
> > >>
> > >> On Wed, Jun 7, 2017 at 9:06 AM, F. T. <[hidden email]> wrote:
> > >> > Hi all,
> > >> > I want to extract fields from PDF files and store it into Hbase.
> Then
> > I
> > >> want to link the database with a Solr collection. To do this, I
> > installed
> > >> Hbase-indexer.
> > >> > What is the best way to do it ?
> > >> > Actually, I can write data into Hbase but not into my Solr
> collection.
> > >> When I launch Hbase-indexer server and when I insert rows into my
> Hbase
> > >> table, I get some errors :
> > >> > Cannot connect to cluster at myIPaddress:2181/solr: cluster not
> > >> found/not ready
> > >> > Somebody to help me ? Thanks in advance.
> > >> > Fred
> > >> >
> > >>
> > > --
> > > -Dima
> >
>